Skip to main content

Does massage therapy reduce cortisol? A comprehensive quantitative review

As an introductory piece to the journal club, this article was chosen for a number of reasons. 

As full coverage of the questions that should be used as a guideline for choosing an article, here is a list of the reasoning behind this selection:

 

Describe WHY you picked this paper

Because I want to.  OK?  You gotta problem with that? smiley

First of all, it deals with a subject on which many of us are familiar.  The "cortisol claim" has been taught in many massage programs and it is highly probable that a number of massage therapists believe that it is the underlying reason for some of the positive effects of massage.

Secondly, it is a claim that is taught in many basic massage programs, independent of what level of advanced training has been achieved and so it is relevant to all massage therapists, no matter what career path they make take.  This information is just as relevant to the therapist that works in a spa who is educating their clients on the benefits of massage as it is to the therapist that works in a medical setting.  The subject matter is important in aiming to understand the causal physiological mechanisms of massage therapy.

Thirdly, it is an example of how important it is for us to keep up to date with new research and how important it is for us to be aim to develop skills on to how critique studies. An understanding of the differences between group and within group analyses are very important if we really want to be sure that a claim is valid. The article also shows that information from different researchers may contradict each other - which is dependent on the type of analyses carried out. 

The article was also chosen because it is an *easy read*.  It may be longer than other articles that will be used in the journal club in the future, but I think that most therapists will find it interesting on the different types of studies that have been chosen and will ask questions on the methods used. 

 

Explain how you came across the article

The research was presented at the MTF conference in Seattle in 2010.  The main author's other articles are also available on a faculty webpage.

The main author has provided a video (the second part is provided in a link) and a presentation on the research and he explains the differences in between-group and within-group methods graphically.  It is recommended that journal club users watch the videos first, then read the article at least once by clicking on the pdf file below (it's a good idea to read an article 3 times, with a break in between each session) and then ask questions.

 

Briefly describe the study

It takes already published studies and examines the levels of cortisol levels using a between group analyses.  This is important since even though a "within group" analyses may indicate that massage reduces cortisol, when between group analyses is used, there is little evidence that this is the case. 

 

The other questions are covered in the comments below. 


NOTE:  the main author of the article has given permission to use this in the journal club as an educational piece. 

If you'd like to read along with the author as he speaks in the video, click here to have a transcription of part 1 of the video open in a new window.

video: 
See video
See video

Between Group/Within Group Effects

The video clearly shows one major consideration that all MTs should be able to understand the differences between within-group and between-group analyses.  At around 2.52 in the first video the explanation is graphically represented and discussed.  If anyone is unsure about this, go to that area of the video that covers the bell curves for the before and after data. Basically, it boils down to this:  if you understand the concepts described between 2:52 and 6:50 on the first video (and understand how time, placebo effects and other confounds can come into play and the reasoning behind the use of a control group), then you understand the use of a control groups in research and you are already probably have a better understanding of research than 95.8% of the general population (probably - but that just came off the top of my head cheeky ).

 

The authors mentioned statistical regression, but don't go jumping into that statistical pool just yet if this is all new to you.  The reason why is that sometimes people can be put off by the "Oh dear, I have no idea what that means" terminology at first.  When we first start looking at research the lingo that is used will be foreign to most, but all it takes is exposure.  If you started talking to *normal* people about proproception, ischemia etc, it would probably have the same effect on them.  Since most of us got over that hump, you know that you can get over the research lingo hump.  All it takes is exposure.

The main thing is this:  if this is all new to you....

 

So getting back to the Journal Club general questions.

Describe the research question using the 4 basic components of the question (PICO):

Population  (who was studied?)

The population included adults and children in already existing studies, selected by a well defined inclusion criteria.  NOTE:  since this is "research on research", the population reference is slightly different.

Intervention (what therapy was applied, tests etc?)

A range of studies were included and the massage interventions mostly included basic massage strokes, applied for a range of times.
 

Comparison or control (how was the intervention controlled?)

The inclusion and exclusion criteria are stated on page 6 of the study.  As for each study, they had different types of control e.g. progressive muscle relaxation to attention.
  

Outcome

The outcome for this particular study was that when between group analyses were completed on compiled data from numerous studies, there was no significant increase in cortisol levels over control groups in studies done on adults.

 

How they aimed to control for bias in their own study

The main way they controlled for bias in this study is that each of the researchers coded studies independently when deciding which studies should be included, then comparisons were made on the agreements.  The agreement rates are stated in the study.  

What is the importance or relevance of the question asked?

The importance is that many therapists have been taught "massage reduces cortisol".  This new study seems to refute that claim.  Teachers should be aware that this claim is now questioned and therapists should be aware of this study if they are making claims to their clients or in marketing materials.  MTs should also be aware that when one researcher makes a claim, another study may refute it.  Understanding of different statistical analyses is important for the MT to ensure that they are providing the correct information to their clients.

The other main question that comes from this study is:  if the cortisol claim is now in question, what other mechanism is coming into play?

State your questions (and answers, if you wish) on the threats to validity in the study.

Questions are in the comments below - with reference to the study itself.

Summarize the results.

The claim that massage reduces cortisol is brought into question.  The strength of this study lies in the analyses done, with comparisons made to the control group and not within group.

Describe the implication of results in clinical practice.

Is the "reduction of cortisol" claim now debunked?  Should massage therapists claim that massage reduced cortisol?

 

Basic questions on the study

Because many of us are not familiar with statistics, I think it best to do the discussion in 2 parts - the second part dealing with the analyses and statistics.

My recommendation is that everyone follow and discuss the first part of the study to get a clear understanding the methods used by the study.

When you first read the study, some may find it strange to see what seem to be contradicting findings within the research world.  First of all, is everyone aware of why the TRI studies are coming to a different conclusion than the others? 

One thing that I think needs to be discussed is what the authors write about this study:

 

 

I believe that one of the main questions would surround the references to % change.  Is everyone in agreement with the authors in that % change is inadequate for this type of analysis and does everyone understand why they make this claim?  Also, this also demonstrates how two different researchers can handle data in a different way.

Does anyone have any comment on this?

 

OK, so I'm not going to go into the statistical analysis yet until there has been a discussion on what I view as one of the most interesting parts of the study, and that is Table 2.  This lovely table pretty much sums up the information from the studies that met the inclusion criteria and from an MTs point of view, it's fascinating for a number of reasons:

  • it breaks down each study into consoliated information that is easily followed by using the key at the bottom of the table
  • you can see the wide spread of the different types of studies included - from dance students to Parkinson's patients
  • you can see the types of massages done, whether the person(s) were trained that were providing massage, how many minutes of massage was given and what the control was for the studies (e.g. relaxation therapy).

 

Does anyone have any comments on the information on that table?  (NOT the statistics, just the, but everything to the left of "Between groups effect sizes").

Just to show the table again, here it is:

 

 

I think there is quite a bit to take in on up to this table, so I hope some questions and general comments will be raised.....

 

 

So how many of you were taught that massage reduces cortisol?

I'm just curious if some of us have not been taught the claim or has everyone heard it in the classroom?

I will confess

 

Not only have I heard that claim in the classroom, I've *taught* it in the classroom.

Now, in light of this new evidence, I don't do that anymore--but in my experience, it is widely believed and taught.

I think its evident that its a widely taught claim by websites

I confess that I had the statement on my "benefits of massage" business page and I've noticed a lot of other therapists have it on theirs too.

Confession is probably good for us! laugh

I learned a lot from this study

I first learned about this study around the time it was presented at the conference. I'd been following TRI's research for years and was at first shocked and then curious. Coincidentally, shortly before I head of it I'd run across some studies myself on PubMed that showed little or no change in cortisol and, in one case, a rise in cortisol levels. I thought that curious but ignored it at the time. 

My initial reaction was, "Whaaat!?!?!? What about all those TRI studies?" I was able to get a copy of the study and, since I don't understand statistics, didn't understand it but I got the general idea. When the videos came out, it helped me to understand the difference between in group change and between group change. After that, I wondered why TRI had not taken that into consideration?

The study not only taught me about different types of comparisons and meta-analyses, but it also reinforced the importance of not being attached to particular ideas. 

 

I'm glad to see folks are

I'm glad to see folks are using Ravensara's site.  (Also, she's too modest to say it, but the site was created with a lot of input and support from Rosemary Chunco.)  I'm also honored that you are discussing this study.

I don't think anyone should be too upset that they were previously claiming massage reduced cortisol.  It's kind of logical that it ought to do so, since cortisol is a stress hormone, and since massage is demonstrated to reduce anxiety and depression, which are stress-related conditions.

But the data are clear.  Massage must be doing that in some other way.  Also, it is important for us to remember that no physical or neurological variable, such as cortisol, is a direct correlate of any psychological state.  The feeling of being stressed is the result of a complex interplay of physiological factors within our body, and cortisol level is just one of them.

Also, we need to remember that "stress" is any demand made on the body, including demands we perceive as positive.  Massage itself is a stressor, and as such it might even cause cortisol to rise in some situations, even as we perceive the massage to be pleasant and beneficial.

-CM

Massage and cortisol as CAT (Critically-Appraised Topic)

Coincidentally, shortly before I head of it I'd run across some studies myself on PubMed that showed little or no change in cortisol and, in one case, a rise in cortisol levels.

 

Interesting, Alice--I will look for them.

The topic seems to be of enough general interest that it could be the first CAT here. I am designing a poll to find out which subjects are of more interest that others to know where to begin, but massage and cortisol are pretty clearly of wide interest.

Are you planning to analyze massage's effect on catecholamines?

I'm carrying out an RAA (Research Article Analysis) carried out by members of Field's team on the effects of vibratory stimulation and hand and foot massage on fetal movement (ETA this afternoon).

One of the statements in the literature review is:

Massage therapy has been shown to affect maternal cortisol and cathecholamine[sic] levels

 

I am assuming that massage's effect on catecholamines--fight-or-flight hormones like epinephrine and norepinephrine, plus the neurotransmitter dopamine--was analyzed in the same way as the effect on cortisol was, and that the methodology is therefore just as problematic.

So in the discussion on that RAA, I will mention this study, and say that the cortisol claim has been contradicted, and that the catecholamine claim likely suffers from the same methodological flaws.

I was just wondering if you have any plans to explicitly carry out the same kind of analysis for the catecholamine claim.

Good stress (eustress) and bad stress (distress)

Also, we need to remember that "stress" is any demand made on the body, including demands we perceive as positive.  Massage itself is a stressor, and as such it might even cause cortisol to rise in some situations, even as we perceive the massage to be pleasant and beneficial.

 

Excellent point, Chris. We often use the word "stress" in the sense of bad stress only.

Hans Selye, the endocrinologist who laid the foundation for the study of the stress response, distinguished between eustress (good) and distress (bad). That's a very useful distinction to keep in mind, and we often blur over it by just saying "stress".

I don't have current plans to

I don't have current plans to examine the catecholamine claim.  I have several other projects I want to pursue more, and there are only a small number of studies with catecholamine data.  You are right that they have been analyzed and presented in the same problematic way.  -CM

Selye's conceptualization of

Selye's conceptualization of stress is the one I find most useful.

I particularly like Selye's technical term for the total absence of stress - death.

-CM

Selye

I particularly like Selye's technical term for the total absence of stress - death.

 

Excellent :)

So that we're all on the same page, would you mind briefly summing up Selye's conceptualization of stress?

I imagine people are familiar with some of the ideas, because of the influence they've had, even if they haven't formally studied who came up with and investigated them.

Stress vs Anxiety

Thanks for creating this Raven; it's looking great. Some very good comments were made here and I'd just like to add that it might be wise to define anxiety as well as stress in this discussion. The word "stress" in our current culture has become a "tag" word that almost everyone uses to promote their business. It has a broad umbrella and a different meaning for each individual. Most of whom say they have stress in their lives, however, are hesitate to use the term anxiety, as it carries a more clinical meaning.

That's an excellent point about defining our terms

Thanks for bringing it up, Robin.

I will see what the consensus is among psychologists; I think we should leverage the work our colleagues have already done, and get back with what I find out.

Addressing stress with the public's perception

Thanks Raven; it will be interesting to see the scientific consensus of the definitions of stress and anxiety, but more importantly for massage therapists may be how to address the public's perception of stress (which may in fact be anxiety). We should come up with a working definition in the MT industry but how does that translate to addressing the public's actual meaning?

wish there was a like button for comments

Since I would like Robin's comment. Does "stress" not imply an environmental trigger for emotional states, whereas anxiety can exist without an environmental trigger and can be a chronic?  (whereas stress is more prone to be a temporary state)?

Also, is the emotional states not a bit different in each?  I would put nervous behavior and worry as a state of anxiety, whereas being overwhelmed or frustrated is more in line with being stressed.

I think as an MT, being aware of a client having an actual clinical diagnoses of having anxiety (or depression, for that matter) is pretty important and we need to be cognizant of *regular* people who describe themselves as being stressed, may just mean that they have a busy schedule at that time.  In fact, could it be that in some situations, some people actually thrive in being slightly stressed in that sense?

Also, I don't think there is enough attention paid to the effects of anxiety can have on someone physiologically. 

Yes, we are talking about several different perspectives here

Does "stress" not imply an environmental trigger for emotional states, whereas anxiety can exist without an environmental trigger and can be a chronic?  (whereas stress is more prone to be a temporary state)?

 

That is certainly a reasonable distinction to draw in terms of how people may understand the meanings of the terms in their own lives.

And yet, as you are both getting at, that may be a very different way from how scientists interpret the terms, which may also differ from how clinicians interpret the term from the clinician side, as opposed to the client/patient side.

My vertebrate endocrinology prof, John Wingfield, liked to point out how, although we tend in our thinking and conversations, to regard "stress" as a bad thing, that it is only chronic stress that harms us. Acute stress ("flight-or-flight", "flee and pee") is actually very adaptive, as it can save our lives.

Yet, although that is factually correct, we cannot assume that that is how our clients are thinking about talking about it in relation to their own lives. In addition to understanding the science behind what is going on, it is also important to be clear on what someone means when they are telling us what they experience going on in their lives.

It's not for nothing that this is called "translational" healthcare, after all. And how we do it is very important, and you both are asking the right questions to get started on that journey.

STRESS VS ANXIETY

It's my understanding that an "anxiety attack" is characterized by rapid pulse and difficulty breathing.

Stress (or rather, negative stress, or too much stress) produces different symptoms in different people.  It may appear as tension headaches, stiff muscles (particularly the neck), insomnia, digestive upsets. 

I'm interested to know what Doc may say about this

No shame in having believed in cortisol reduction

Chris Moyer wrote: "I don't think anyone should be too upset that they were previously claiming massage reduced cortisol. "

Thanks for saying that, Chris. I felt like the "confessions" were unnecessary. I wasn't taught it in school because I went to school before TRI existed! However, when I first learned about TRI I was very excited about following their studies because for (essentially) the first time we had someone bringing research into the field of massage therapy. I even met Dr. Field when she came to St. Louis and presented a talk to a conference on infant massage therapy and wrote an article for a newspaper about it.

I feel absolutely no shame about any of that at all, not one whit. It was plausible and it was the best science we had available at the time.

And, new evidence requires us to adapt our thinking. No big deal. Okay, so it's a big deal for anyone running a cortisol reduction service but for the rest of us, it means we adjust our thinking and how we communicate to clients and we just keep making them feel good as best we know how.

The shame would be if this claim *continues* to be perpetuated in light of the evidence we now have. Every massage therapist should know about this study.

I also thank Chris for pointing out that massage is a stressor. Zhenya taught us this and understanding this guides some of our decisions, in Russian massage, about what to do, when to do it, how much to do it, and when not to do anything. People do not think of massage as a stressor because it feels good, but we are interfering with the processes of the body, we are effecting a change, we are making the body adapt to external input. We are creating a stressor.

I still don't understand beans about the numbers and I'm not sure I want to. Yet. Maybe I'm a lazy bum. I want someone else who knows what they mean to just tell me, at least for now. I'm working enough at wrapping my head around the rest of this information which, since it comes in words, is much easier to understand.

p.s. Raven: I think the study that showed a rise was done in a Scandanavian country, but I found it on PubMed while looking for something else.

OK - so a discussion on methods first......

Since we're now "live"  (Yee haw!), I want to get a bit of a discussion going on the actual methods of the study first, before we go on to looking at the statistics.

Due to the fact that this is a 'research on already done research", where the data is gathered from existing studies, we would need to look closely at the studies that were involved to ensure that there was relevant inclusion and exclusion criteria, and also make sure that enough databases were used in the search for relevant studies.

First of all, I want to address "how the studies were picked".  You should also bear in mind that the main author has contested the cortisol claim before this study.  Why does this matter - because of that old thing called cherry picking.

OK, so lets's take a look:

Searches were done using "massage cortisol: in the following: CINAHL, Dissertation abstracts (which I'm unfamiliar with) , Google Scholar , PschInfo and PubMed.

After an initial scan, they got 173 which needed closer inspection.

OK, so this is where the "picking" gets more detailed.

They picked ones that met

  • their operational definition of MT  (ok, now look at the chart above and see if the MTs were trained - any comment on that?)
  • provided graphical and numerical data on the effect of MT on cortisol levels on humans
  • used random assignments of participants to MT conditions and one or more control conditions (which vary - look at the chart again)
  • reported results not duplicated in another retrieved article (why?  any comment on that)

 

So using this, the 173 was whittled down to 18 articles, containing 19 studies. 

Now we go on to coding of those studies.  Before we go on to that, I just wanted people to look at the study data again and get their feedback on:

  • Do you think that the inclusion criteria suited the question asked?
  • Do you think the exclusion criteria was explained enough after they got the 173 studies?
  • Looking at those studies again  on the chart - do you have any comment on the possible relationships between *timing of massage* and the method of cortisol measurement.  In other words - do you think that measurement of blood cortisol might be different in a 10 minute massage if it is compared to measurement of salivary cortisol?  Also, look at the variations in the timing - 10 minutes up to 60 minutes.  Do you think that maybe timing of the session would have some major influence?
  • Don't you just love this stuff?

 

Regarding whether those

Regarding whether those giving the massage were trained - most were. It appears that in some cases it is not known. In the one study using untrained people, I'm assuming it's the parents of the children with RA based on that I know TRI has used parents giving massage to their children in at least one other study.

As for the timing of when cortisol was measured, I don't know enough about it to know what would be considered appropriate timing.

"reported results not duplicated in another retrieved article" I'm not sure I understand what that means. Does that mean if TRI did a study and got certain results and I did a similar study and got similar results, that my study would not be included? Or does it mean that if I did an evaluation of, say, three studies, including one done at TRI, and the one done at TRI was included in another article, that only one of them would be used because it's a duplicate i.e. you do not include the same study twice?

Regarding the timing and the method of measurement

What I wondered about was since this is determined by speed of ANS response (in a way), then is the factor of time and measurement not something to be considered. 

Only one of the study used blood analysis - the rest used urine or saliva.

I would have thought that urine testing would require more "time" to have an effect on what it is that we're measuring.

Also, there is also the question of instrument validation.  Is one method better than another?

I realize that this is nit-picking on the individual studies themselves, but since this is a case of research on research, the top level research is heavily dependent on the the quality of the studies that are included in it.

 

Here's something else that I'm throwing out here just to see what people think.  There very rarely seems to be any reference to whether or not recipients have received massage previous to the start of types of studies.  Surely this would have a major impact?  Even if you had a recipient who had been getting regular massage for a period of time, then had stopped, it would be a totally different experience for them over someone who was experiencing it for the first time as part of the study - and that different experience could throw a spanner in the works.  There is also the fact that effects may be larger for those who receive massage regularly.  It could be that adequate sample numbers might take care of that, but at the same time, I can't help but think that if 25% of the sample were receiving massage for the first time, it would have different findings from 75% receiving massage for the first time in the trial.

But it's just a thought and it might be the wrong way of thinking of it.

absolutely right

What I wondered about was since this is determined by speed of ANS response (in a way), then is the factor of time and measurement not something to be considered. 

 

Yes, this is very important. The experimental design needs to take this into consideration when determining the measurements, or else, the measurements will be inconsistent, and the researchers will be comparing apples to oranges.

You may have heard of the concept of "half-life" in connection with radiation. "Half-life" is the length of time it takes half of the amount of material present to decay--in the case of radioactivity, to turn into a different (hopefully, harmless) element. In the case of biochemical and pharmacological compounds, the component chemicals re-form and rearrange into different chemical compounds.

This next part is a little bit counterintuitive: what "half of the amount of the material present" means is going to change each time we measure it.

Let's say we have 16 units of a material that has a half-life of one day. After one day has passed, half of the amount of material present is gone, so we have 8 units left.

Then, the amount of material present is 8 units, and half of the amount of material present is 4 units, so after the second day has passed, 4 units of the 8 are gone, and we have 4 units left.

So then the amount of material present is 4 units, and half of the amount of material present is 2 units, so after the third day has passed, 2 units of the 4 are gone, and we have 2 units left.

In the same way, after the fourth day has passed, only 1 unit of the original 16 is left, and then we start getting 1/2 a unit, 1/4 a unit, 1/8/ a unit, and so on.

 

 

I remember in my vertebrate endocrinology class, my prof mentioned that the half-life for blood cortisol is rather short--like an hour or so--so they would have had to measure it very soon after the intervention, or else it would drop off too much too fast to get an accurate reading. I tried Googling to get the exact number, but no success with that. But you get the idea.

Only one of the study used blood analysis - the rest used urine or saliva.

I would have thought that urine testing would require more "time" to have an effect on what it is that we're measuring.

 

Yes, it takes longer to get into the urine or the saliva than into the blood, so along with the half-life, which body material it's in will have an effect on when to measure it.

Back when I was working on the bear vaginal cytology data to determine their reproductive cycle, I didn't have much to do with the endocrinology side of the study (reproductive hormones, not cortisol). However, blood sampling would have been too invasive, and even saliva would have been hard to get. Urine's too hard to collect, since it soaks right into the ground, so poop was the vehicle of choice for measuring circulating reproductive hormone levels in bears. Like I said, though, I didn't participate in the poop collection or hormone measurement.

Also, there is also the question of instrument validation.  Is one method better than another?

 

I don't know the answer to that, but I do know that within one study, it is important to pick one way to validate it and then stick to it. I suspect, but don't know, that to compare the studies, they had to validate in similar ways; otherwise, the data would just be too different to compare.

I realize that this is nit-picking on the individual studies themselves, but since this is a case of research on research, the top level research is heavily dependent on the the quality of the studies that are included in it.

 

Exactly right.

There very rarely seems to be any reference to whether or not recipients have received massage previous to the start of types of studies.  Surely this would have a major impact?  Even if you had a recipient who had been getting regular massage for a period of time, then had stopped, it would be a totally different experience for them over someone who was experiencing it for the first time as part of the study - and that different experience could throw a spanner in the works.  There is also the fact that effects may be larger for those who receive massage regularly.  It could be that adequate sample numbers might take care of that, but at the same time, I can't help but think that if 25% of the sample were receiving massage for the first time, it would have different findings from 75% receiving massage for the first time in the trial.

 

Interesting question. I don't know the answer to that. I have seen studies where they instructed the clients not to get massage outside of the study for the duration of that study, but I don't think I've ever seen people ruled in or out on the basis of whether they have had massage before.

duplication versus replication

"reported results not duplicated in another retrieved article" I'm not sure I understand what that means. Does that mean if TRI did a study and got certain results and I did a similar study and got similar results, that my study would not be included? Or does it mean that if I did an evaluation of, say, three studies, including one done at TRI, and the one done at TRI was included in another article, that only one of them would be used because it's a duplicate i.e. you do not include the same study twice?

 

Yes, you wouldn't want to include the same study twice.

On the other hand, if you did a similar study, and got similar results, that's not counting the same thing twice, but rather you are replicating the results of the other study. That's actually useful as confirmation.

OK, so here's something interesting.....

I wanted to take a closer look at *when* the samples were taken, since that seems like a significant thing to consider.

so I went and looked at one of the studies on here and this thought this was interesting:

 

OK, so, the timing should matter in the salivary tests.

BUT, I would have liked to have seen a reference in here to see where this information has been verified (that it reflects cortisol levels before).

So - by taking samples 30 minutes AFTER the session, it's an actual measurement of what was going on 10 minutes after the end of it.  Would taking it 20 minutes after not give a better reading?  Or does it not matter?

 

Also, I'm not sure if taking the sample 30 minutes prior is a good idea - because that is sampling  levels from stimulation 50 minutes before the start of the session.  In order to have it more controlled, would it not have been better to have all subjects do the same thing for 30 minutes before the start of the session (like sit quietly in a room or something else) and then take a swab right before the session?  That way, there is more control over the immediate environment for all participants.  But then, maybe there method had something got to do with the fact that the mothers were taking the swabs (presumably).

So one of the first things I would ask about the studies is "are they all going by this 20 minute rule on the cortisol saliva tests and if so, can that rule be validated/referenced to make sure it's right".

Also, what about those that have only 10 or 15 minute sessions?  The session itself wouldn't have lasted the lag time.  Anyone have a comment on that?

OK, so as a wee "oohhh look at that" in that same study that I found the above on, take a look at this table:

I know we haven't got into stats on the main study yet, but before we do, I think it might be a good idea to go through this chart so that people understand it's meaning.  That way, when we're looking at the stats on the "research on the research" we know what the base research stats referred to.

 

Statistics

Raven is expanding this into separate blog posts - which is excellent, since those of us that need more "digest" time can take the statistics part in a wider scope (since we'll be concentrating mostly on the cortisol levels in this one and going to the main study again shortly) and in smaller chunks.

So, here is Raven's first description of the main elements of the table, followed by what is meant by the M - the mean.

Raven also mentioned something about the measurement of cortisol (the units).  I've also seen nmol/L (nanomoles per Liter) and µg/dL in studies.  Does anyone want to see what some of the other studies use that have been referenced in the main study?  (Google scholar is great for finding full texts). 

Here is a basic description of SD (standard deviation), and don't be put off by the "kiddy look" of it  - it has some handy wee references for brushing up on basic stats (I think the last time I looked at stats was probably 28 years ago!)  (and who doesn't like references to dogs?). 

The formulas can look at bit intimidating, but knowing the relevance and "why we should know the concept of it" is the most important.

So taking a look at those pre- and post- levels for M and SD for cortisol levels for the first session and the last you can see that the mean decreased for the first session, but it actually increased for the relaxation group.  For the last session the mean decreased for both. 

I thought it odd that there was an increase in the spread (the SD) and the mean for the relaxation group after the first session, but then this is a small sample number (remember that there were 10 in the control group only).  Had there been a greater sample number, maybe this wouldn't have been the case. Am I right in thinking that the number of kids *in* the study matter and can have a major impact on the overall stats?

What other things could have been going on that could have driven up the mean for the first session for the relaxation group?

Now, the fact that I'm commenting on it doesn't mean that much, but I would have expected there to be at least some reference to this in the discussion.  But if you look at the discussion, it seems to be that the cortisol levels aren't referred to that much since previous research had already made the claim that cortisol levels decreased for the massage group. 

 

Here is a question for the researchers - how come the "raw data" is rarely given in studies?  It would have been great if the data from each subject at each session would have been available in a table in an appendix - that way we could go through it step by step.  Is it not normal protocol to do that?  Would it not help serve fellow researchers who may want to do different statistical analysis on the same data?  How can a well run peer review be done without providing that?

 

Good observations

 

Here is a basic description of SD (standard deviation), and don't be put off by the "kiddy look" of it  - it has some handy wee references for brushing up on basic stats (I think the last time I looked at stats was probably 28 years ago!)  (and who doesn't like references to dogs?). 

 

Thanks--I was not familiar with this; this is actually quite good. I'll put it in the Links section for reference.

 

The formulas can look at bit intimidating, but knowing the relevance and "why we should know the concept of it" is the most important.

 

Exactly--remember Bloom's learning taxonomy, as modified and applied in the InterJurisdictional Entry-to-Practice Competency Profile for Massage Therapists? I think that knowing the relevance and why we should know the concept of it meets their stated goals of reaching

  • Cognitive Level 2: Comprehend & apply information
  • Cognitive Level 3: Analyze and interpret information

 

in research literacy.

They don't require Cognitive Level 4: Synthesize (create new) information, which is where I would think using the formulas to generate new data would come in.

 

 

eek!

I wrote a very long, detailed reply to your questions on data sharing, and it got eaten!

I'll pick it up again tomorrow, but it's bedtime now.

Cheers!

ok, I have learned my lesson

comments can get lost when you hit the Save button.

I didn't follow my own advice to save a copy of my work, and as a result, I'm starting over again from zero on the data sharing question.

At POEM, we value your comments and questions, and don't want the same thing to happen to you. Make sure you have a copy of your work saved in case something like what happened to my comment happens to yours.

Back to data sharing, now.

data sharing--why don't articles publish raw data?

(Splitting up my answer over several shorter posts, both to avoid losing data like previously, and also because the comment that was lost was so long that it may well have caused everyone's eyes to glaze over. Perhaps losing it was exactly the right thing to happen. :)

 

Here is a question for the researchers - how come the "raw data" is rarely given in studies?

 

Several different reasons come together to cause that:

  1. Back when print journals were the only alternative, physical space in the form of page length and mailing costs was a much bigger issue than it now is, when you can include a link to supplementary information, including raw data, at a Website. Now, a real constraint at the time may live on as a habit now.
  2. Principal investigators (PIs) may consider it as their own private work product, an argument that--in the case of taxpayer-funded research--runs up against the fact that the grants supporting the research are funded by taxes, and the data should belong to those who supported the research. For PIs to use the data in for-profit ventures as their own work product represents a conflict of interest for this reason. Congress and the National Institutes of Health have tightened the rules on disclosure and publication in open-access forums for this reason.
  3. Fear of having the data you worked for stolen and misused. I actually know university biology teachers who've had their class data stolen and plagiarized, so this concern is not imaginary or exaggerated at all. It's that much more of a risk at high levels of research,
  4. Tradition and subcultures: molecular biology and genetics have traditionally been much more open-access and sharing than, for example, neuroscience.

I'm sure there are many more reasons why the situation developed in that way; those are just the ones that come to mind off the top of my head right away.

Cool

Thanks, Raven.  Those are understandable.

Now, arising from that, ensuring no data-dropping or tweaking would be key - and I'm looking at this from a consumer's perspective.  In this study there didn't seem to be any blinding at all in regard to the cortisol collection and analysis and I think that could have been easily done by getting a collector who does not know which kid is getting what treatment, give the samples numbers, when they come back from the lab, the data is also entered by someone who doesn't know who is getting what - and this could also have another blinded person there to make sure the data is entered correctly.  I know that requires more people and maybe I'm being too picky.  You can let me know.

Also, I know it wasn't in the inclusion criteria for the main study (Moyer's study, not Field's)

OK, so I went and looked at another study on cortisol measurement, just to see what they were doing time wise and also what their unit of measurement was. 

By the way, has anyone noticed that when you go and start looking for the studies themselves, even though you might think that it's a completely different set of researchers in it because of the main author's name, when you find out there is a major overlap in the names with other studies, you begin to realize that the variation in the researchers isn't as wide as you think.

I've to run on here, but I'll be back later.

 

data sharing-availability in appendix

 It would have been great if the data from each subject at each session would have been available in a table in an appendix - that way we could go through it step by step.  Is it not normal protocol to do that?

 

Not at present, no. There is a tradition in some clinical and applied disciplines, such as epidemiology (part of public health: the study of how sickness and wellness are distributed in populations, and what factors play roles in those differing distributions) where Table 1 typically summarizes the demographic data in an overview with descriptive statistics.

But no, it is not presently normal protocol, and yes, you are right--that would be quite useful.

 

 

Peer review

How can a well run peer review be done without providing that?

 

Typically, peer review operates at a higher level of analysis than poring over the data with a fine-toothed comb. If the study design is appropriate for addressing the research questions, and they've chosen the correct statistical tests for the situation, and there aren't any other glaring issues, then the peer reviewer tends to spot-check calculations, but doesn't sit down and examine every single point of raw data.

This is one reason why trust is so important in science--when someone like Werner Bezwoda [1] or Hwang Woo-Suk [2] falsifies data, and it gets past peer review, it ripples through a huge audience that trusts the peer reviewers and the process to get it right. It harms individual patients, that is true, but additionally, it harms the entire web of science. That's why it's dealt with so harshly.

Kirby Lee and Lisa Bero have advocated data sharing/data audit to cut down on cases like the above [3]. You've independently hit on the same point they are making: the contribution of data sharing to improving peer review, and through that, to improving published science.

 
[1] Why Big Lies Matter accessed 8 September 2011
 
[2] For Science's Gatekeepers, A Credibility Gap accessed 8 September 2011
 

absolutely, data sharing would promote collaboration

Would it not help serve fellow researchers who may want to do different statistical analysis on the same data?

 

That's one way collaboration could take place (although, remember, fellow researchers are also competitors to some degree); another way is to look at the same data through the lens of a different discipline--analyzing neuroscience data from a genomics point of view, for example.

There are a couple of articles that are freely available as full-text online, that explore issues around data sharing in more depth:

Abstract: After more than a decade of collecting large neuroimaging datasets, neuroscientists are now working to archive these studies in publicly accessible databases. In particular, the fMRI Data Center (fMRIDC), a high-performance computing center managed by computer and brain scientists, seeks to catalogue and openly disseminate the data from published fMRI studies to the community. This repository enables experimental validation and allows researchers to combine and examine patterns of brain activity beyond that of any single study. As with some biological databases, early scientific, technical and sociological concerns hindered initial acceptance of the fMRIDC. However, with the continued growth of this and other neuroscience archives, researchers are recognizing the potential of such resources for identifying new knowledge about cognitive and neural activity. Thus, the field of neuroimaging is following the lead of biology and chemistry, mining its accumulating body of knowledge and moving toward a 'discovery science' of brain function.
 
Two things that interest me around these issues are:
  • Is there a possibility that POEM could serve a role as a repository for storage and integration of primary data on massage?
  • What other disciplines could fruitfully share their primary data with massage, or could use primary data from massage studies for knowledge discovery purposes?

oh - nice one!

Two things that interest me around these issues are:
  • Is there a possibility that POEM could serve a role as a repository for storage and integration of primary data on massage?
  •  What other disciplines could fruitfully share their primary data with massage, or could use primary data from massage studies for knowledge discovery purposes?

 

I think if there was some way of making it easy for data collection and consolidation, it would be a good idea.  Are you only talking about already published data in journals (where there is an extraction and it would be put into tables that would be easy to see - kind of like the table in Moyer study, where it's easy to see what the study is about and key elements, but the data would be expanded out) or could you be referring to non-published data?  Or both? 

Before that happens though, I think there needs to be a level of knowledge and standards reached for research literacy and the ability to critique and analyse methodologies generally within the MT community, and that is quite a way off.  That also goes for case studies too.  Also, there's a quality issue. I always thought it would be really good if journals were given "standard stars" - a 1 star would mean that the level of scientific rigor for the journal isn't all that great.  10 stars would mean it's pretty good!  If the same were given for studies, that might also be a plus - but that's just me and I know that would be impossible.  But, if therapists kind of have their own star system in their heads, it would be good.  You referred to trust among peer reviewers - I think massage therapists are way too trusting in what they read in our magazines and the like about research.  That's a problem in my view (and in society in general).  So, yeah, I think it's a great idea, but concentrating on that right now might be putting the cart before the horse.  The way I look at it, is that POEM will make research transparent to the massage community and bust open that big cloud - but we've got to know what we're looking at when that cloud goes away. 

Anyway - that's just one viewpoint - next post will be back to the study of the studies.......

this is a wee short one

post, that is, since one of my favorite flicks is on t.v. at the minute.

Typically, peer review operates at a higher level of analysis than poring over the data with a fine-toothed comb. If the study design is appropriate for addressing the research questions, and they've chosen the correct statistical tests for the situation, and there aren't any other glaring issues, then the peer reviewer tends to spot-check calculations, but doesn't sit down and examine every single point of raw data. 

I can understand that going though the data and re-running the numbers would be maybe too much overall.

I think you've touched on something that we should be aware of from a "knowing the concept" level though and that is that the statistical tests are dependent on the type of data, the size of the sample and the methodology. Is there anything else that it is dependent on? So we might see things like ANOVA and MANOVA (ANOVA stands for analysis of variance), which are similar, but are used to ask slightly different questions: one asks "Is there a difference between 2 or more groups" and the other asks a similar but slightly different question "Is there a difference in 2 or more outcomes among two or more groups?"

So hopefully the peer review will have made sure that the type of statistical testing is correct. 

 

Referring to the study a couple of posts back, you can see that the M (mean) and SD are definitely used a lot and everyone should be familiar with their concepts:

Even though this table does not refer to cortisol levels, you can see the mean and the standard deviation for each of the characteristics of the groups in that study and I hope everyone is comfortable with their meaning now.  You can also see the number in each group (n=) and the last column you have ANOVA and p. 

I'm not going to address that ( "p" ) in this post, but I would like to look at other things in that study.

First off, you'll see that the study is mainly geared towards psychological effects of partners on stress levels.  That is why the third column here is "physicial contact", not "Massage".  Now, even though the physical contact may have included massage, does anyone have any comment on this being included in the top level study (is there some level of training given to the partners beforehand?  What about the description of the technique - is it detailed enough? Would you have included it if you were doing a study on the effect of massage therapy on cortisol levels?  What about the timing in the study?). 

The other thing I'd like to look at is the table that describes changes in cortisol levels in that study:

Some of you might wonder what those bars mean on the chart above and below the dots.  Anyone care on commenting on their meaning or just guessing what they might mean? 

I'll be back on to throw more questions tomorrow, Raven! 

 

 

 

 

charts

Since no one gave this a shot, here is a magazine article that Raven wrote on interpreting them. 

Also, if anyone is interested, there was a little "read this, then test your understanding" on some of the basics of research literacy elsewhere on this site.

Anyway, I might be writing this for nothing since no one seems to be commenting on my comments, but I'll plod on anyway.  It doesn't seem that anyone has an issue with the Ditzen study being included in the Moyer study.  I question whether it should have been included at all, remember that part the title is "Does Massage THERAPY reduce cortisol......"

It makes me wonder just how loose the determination is of  massage therapy in the inclusion criteria.  If you look closer at the study, it is described as "instructed neck and shoulder massage", which will be carried out by a partner.  There is no mention of who did the instruction, the protocol or anything.  And even in the study, it is referred to as "physical contact" in most of it. 

Getting back to the inclusion criteria in the Moyer study, there is very little description of what would constitute as massage therapy.  It looks like if the word "massage" is mentioned at all in a study and it has some mention of contact, then that's all that is required for inclusion.

As an MT, I wouldn't classify what was in this study as massage therapy.

Take a look at the Moyer study again to see what the inclusion criteria is:

 

Along with this:

 

 

The argument could be made that it might meet the inclusion criteria, but my response is that the inclusion criteria doesn't meet what I would view as massage therapy.

Here's a question - am I wasting my time even trying to get people to comment on this (besides Raven?).

 

I may not be commenting, but

I may not be commenting, but I'm still reading and learning!

I think that's worth asking

Here's a question - am I wasting my time even trying to get people to comment on this (besides Raven?).

 

Nobody wants to waste their time, and I don't blame you for asking for feedback. But I would not assume that people are not reading and learning, just because they are not commenting--Adrienne has already mentioned that she is, and I have no doubt that others are as well. It is very common that sites have many people who read silently for every commenter--I've seen figures like 10 readers for each commenter.

People read and learn at their own rates, and the degree to which they wish to chime in will also vary. They're also going to want to get a sense of what the site is, and it's early days yet, so I expect that people will chime in more as they get a better feel for it.

So it's absolutely reasonable to wonder, just as a reality check, if you're wasting your time, but I would say both that

  • you are definitely not wasting your time, because people are watching and learning at their own rate, and
  • you are doing a really good job of touching on several very important issues that are quite relevant to this research--in the study, such as the timing of the assessments, and the larger context, such as data sharing and its effects on the quality of research. 

 

I am very pleased with how you are doing with facilitating this month's Journal Club, and I would say that you are providing food for thought that others will find useful to consider.

Your process is a good one, and I trust that others will join the discussion at the point at which they feel safe and comfortable doing so. And if that joining means reading and absorbing what other commenters say, that counts for learning as well.

 

I'm reading too Rose

I'm actually fairly new to the site (very busy year) and have been poking around, getting familiar and catching up the past two weeks. You're doing a great job Rose and I'll comment when I have something new to add. Thanks for all you do!

 

Cheers!

Thanks, Adrienne, Raven and Robin!

Actually, I'll confess to being in a *bit of a mood* when I wrote that last post.  I'm over it now.  Thanks again for your feedback.

Since this is the first article to be reviewed in the journal club, I should be aware that this is all completely new to most of us, even me.  So getting into the groove will probably take a while! 

As a recapper on some of the points already touched upon, here is a list of what I think we (I'm referring to massage therapists mostly, since I think that is the group that is the most interested in this) should consider.  Also, I'm not sure if everyone is aware of this little list of base questions that can be used as a guide.

  • When we see "research has shown that massage therapy will....." how well defined is *massage therapy*.  If the research involves a group of people that has been shown how to do a couple of strokes for a short period of time, does that count?  How do they differentiate between a trained massage therapist who has had at least some level of training at a massage school and someone that has had 15 minutes of training?  Bear in mind that the "Touch Research Institute" has "Touch" in it, and not "Massage Therapy".  Do we (massage therapists) not see this as an issue and if we don't, why are we bothering to spend a lot of money and time at massage school?

 

  • Uncritical acceptance of any study without digging a deeper is a mistake, especially if it is a top level study that is a "research on research". It requires some level of investigation into the research that is being reviewed. Understanding that there is a difference between just taking the data from multiple studies and doing different comparisons and doing that AFTER doing some analysis on whether the data from those studies is actually backed by good methodology and then only picking the good ones is another.  Data may not be worth much if the methods behind it are highly questionable. The question that the data answers and the methods on how we got that data are very important.  We need to consider this a lot in this type of study.  (Now, in this study, the fact that most of that same data was used in another study that used very weak statistical analysis should be noted.)

 

  • Just because an article has been through peer review doesn't mean that it's perfect.  Just because someone with a lot of letters after their name is the author doesn't mean that it's perfect either. Sometimes top level studies may not have included a well run study because the full data that they needed may not have been made available.  This can have a major impact on findings.  Inclusion/Exclusion criteria is really important in this respect.  It is always a good idea to do searches and investigate why certain studies have been left out.

 

In the next post there will  links to some more of the abstracts or full texts in this study (one thing that everyone will learn very fast is that you will need the full text to get the information you need most of the time- not always an easy thing to get).  I'll also put some links to other articles that I think have important information in them regarding cortisol.

I've got to go back to work here for a bit though! 

- Rosemary

 

 

 

Some links to studies and abstracts

This first post carries either links to abstracts or links to full texts.  I've put all the authors names (to indicate overlap in researchers among groups).  I've also put the full title of the studies.  You'll probably notice that some of them involve reflexology.

You will also notice that some of them are dissertations and so I cannot find them.  I'm not sure how common it is to have dissertation data in studies like this, but for the average consumer who doesn't have access to academic resources it is a little infuriating since we can't even see the abstracts.  To the researchers - is there any way of getting around this? 

I recommend bookmarking the full texts or putting them in some electronic organizer (like Zotero).

Arroyo-Morales, Manuel; Olea, Nicolas; Ruíz, Concepción; Castilo, Juan de Dios Luna del; Martínez, Manuel; Lorenzo, Carmen; Díaz-Rodríguez, Lourdes
http://journals.lww.com/nsca-jscr/Abstract/2009/03000/Massage_After_Exercise_Responses_of_Immunologic.37.aspx
Massage After Exercise-Responses of Immunologic and Endocrine Markers: A Randomized Single-Blind Placebo-Controlled Study

Chin - this is a dissertation, so I was unable to get it.

Beate Ditzen, Inga D. Neumannc, Guy Bodenmannd, Bernadette von Dawanse, Rebecca A. Turnerf, Ulrike Ehlerta, Markus Heinrichse Effects ofdifferent kinds of couple interaction on cortisol and heart rate responses to stress in women
www.psychologie.uni-freiburg.de/abteilungen/psychobio/team/publikationen/Ditzen-PNEC-07.pdf

 
Tiffany Field,Nancy Grizzle, Frank Scafidi, Sonya Abrams, Sarah Richardson, Cynthia Kuhn, Saul Schanberg
Massage theraphy for infants of depressed mothers
www.sciencedirect.com/science/article/pii/S016363839690048X


Tiffany Field, Gail Ironson, Frank Scafidi, Tom Nawrocki, Alex Goncalves, Iris Burman, Jeff Pickens, Nathan Fox, Saul Schanberg and Cynthia Kuhn   Massage Therapy Reduces Anxiety and Enhances Eeg Pattern of Alertness and Math Computations
http://spamovil.com/wp-content/uploads/2011/05/Massage-therapy-reduces-anxiety-and-enhances-EEG-patters-of-1.pdf


Tiffany Field,Maria Hernandez-Reif,Susan Seligmen,Josh Krasnegor,William Sunshine,Rafael Rivas-Chacon,Saul Schanberg,Cynthia Kuhn
Juvenile Rheumatoid Arthritis: Benefits from Massage Theraphy

http://jpepsy.oxfordjournals.org/content/22/5/607.full.pdf+html 
 

Tiffany Field,Miguel Diego,Maria Hernandez-Reif,Osvelia Deeds,Vitillius Holder,Saul Schanberg Cynthia Kuhn
Depressed Pregnant Black Women Have a Greater Incidence of Prematurity and Low Birthweight Outcomes
www.ncbi.nlm.nih.gov/pmc/articles/PMC2652730/


Maria Hernandez-Reif,Tiffany Field, Josh Krasnego,Z. Hossain,Hillary Theakston, I. Burman
High blood pressure and associated symptoms were reduced by massage therapy
www.sciencedirect.com/science/article/pii/S1360859299901298


Maria Hernandez-reif,Tiffany Field,Josh Krasnegor and Hillary Theakston
Lower Back Pain is Reduced and Range of Motion Increased After Massage Therapy
http://informahealthcare.com/doi/abs/10.3109/00207450109149744


Maria Hernandez-Reif,Tiffany Field,Shay Largiea,Christy Cullena,Julia Beutlera,Chris Sandersa,William Weinerb,Dinorah Rodriguez-Batemanb,Lisette Zelayab,Saul Schanberc,Cynthia Kuhn
Parkinson's disease symptoms are differentially affected by massage therapy vs. progressive muscle relaxation: a pilot study
www.sciencedirect.com/science/article/pii/S1360859202902822

 

Maria Hernandez-Reif,Gail Ironsonb,Tiffany Field,Judith Hurley,Galia Katza,Miguel Diegoa,Sharlene Weissd,Mary Ann Fletchere,Saul Schanbergf, Cynthia Kuhnf,Iris Burmang
Breast cancer patients have improved immune and neuroendocrine functions following massage therapy
http://www.sciencedirect.com/science/article/pii/S0022399903005002


Sonya Khilnani,Tiffany Field,Maria Hernandez-Reif,Saul Schanberg
Massage Therapy Improves Mood and Behavior of Students with Attention-Deficit/hyperactivity Disorder
http://cranepsych2.edublogs.org/files/2009/08/massage-ADHD.pdf


Leivadi,S.Hernandez-Rei,M. Field,T. O'Rourke,M. D'Arienzo,S. Lewis,D. Del Pino, N. Schanberg, S. Kuhn, C.
Massage Therapy and Relaxation Effects on University Dance Students
couldn't get this one at all!  can anyone else?

Peter A. Mackeretha, Katie Boothb, Valerie F. Hillierb, Ann-Louise Caressb
Reflexology and progressive muscle relaxation training for people with multiple sclerosis: A crossover trial
www.sciencedirect.com/science/article/pii/S1744388108000650


A.J. Mc Vicar, C.R. Greenwood, F. Fewell, V. D’Arcy, S. Chandrasekharan, L.C. Alldridge
Evaluation of anxiety, salivary cortisol and melatonin secretion following reflexology treatment: A pilot study in healthy individuals
http://www.sciencedirect.com/science/article/pii/S1744388106000909

Menard M.B., 1995. The Effect of Therapeutic Massage on Post
Surgical Outcomes [dissertation]. Charlottesville, VA.  yes, yet another dissertation!

Olney C., 2007. Back Massage: Long Term Effects and Dosage
Determination for Persons with Pre-Hypertension and
Hypertension [dissertation]. University of South Florida,  Tampa, FL  and yet another disseration!

Ann Gill Taylor,Daniel I. Galper,Peyton Taylor,Laurel W. Rice,Willie Andersen,William Irvin,Xin-Qun Wang,Frank E. Harrell Jr
Effects of Adjunctive Swedish Massage and Vibration Therapy on Short-Term Postoperative Outcomes: A Randomized, Controlled Trial
http://www.liebertonline.com/doi/abs/10.1089/107555303321222964
 

In the next post I'll put a link to some studies that I think are important - just wanted to keep it separate.

 

Some articles of interest......

Incidentally, when you're looking at these and you're looking at cortisol, just concentrate on cortisol only - the massage technique, the collections, how they got the data.  You'll notice that there is a lot going on in most of the studies besides just cortisol levels being investigated, and if you try to look at everything it will be information overload for sure. 

This is the study that used % decrease that was referred to eariler in the thread:

CORTISOL DECREASES AND SEROTONIN AND DOPAMINE INCREASE FOLLOWING MASSAGE THERAPY

 http://flagstaffazpilates.com/images/Cortisol.pdf 

 

This is an interesting article by Moraska on cortisol measurement:

http://downloads.hindawi.com/journals/ecam/2010/292069.pdf 

 

And this is an interesting commentary on between group analyses (or lack of):

http://downloads.hindawi.com/journals/ecam/2009/684745.pdf 

 

I know it's a lot to take in, but we can take our time.

One thing to look out for is maybe go into google scholar and try to find studies (prefereable full texts) that have been excluded that involve cortisol and massage and then ask yourself why they were excluded - looking at the inclusion and exclusion criteria.

The next post will be about statistics, which we'll be bouncing back to periodically.

 

Lots of posts and discussion

Hi folks.  Last I was here this discussion was only just getting started, but since then this site has gone public and I see now that there has been quite a lot of activity.  Rosemary asked me if I'd take a look.

There are several different subtopics going on within the thread, so I will likely miss something.  But if you would like to direct a question to me as the lead author of the study being examined, please do so and I'll try to answer it.

Rosemary pointed out a specific study that some may not consider to qualify as "massage therapy."  It's a valid point, and one of the critical steps in a secondary data analysis is determining what your inclusion/exclusion criteria are.  Having said that, it should also be noted that the outcome data are given separately for each study that we included, so it is possible for the reader to see if the results from that one study (Ditzen et al., I think it was) are different from the others.  If one has some statistical experience, they could even go so far as to recalculate the overall results minus that study - all the data necessary to do that should be there.  Sometimes, if the pool of studies being examined is large (say, at least dozens, but maybe even hundreds or, in rare instances, thousands) the review paper will even include different results based on refinements to the inclusion/exclusion criteria.  We didn't do that in the present paper, mainly because we did not see a need for it due to the general uniformity of the results.

a few questions for the lead author

Thanks, Christopher.

Everyone should realize that having the lead author on the JC who is letting it known that he is availalbe to answer questions will probably not happen that much in the future, so you should take advantage of this.  Off the bat, I'm going to start:

- During the peer review process did any of the reviewers question the viability of having the Ditzen study included?

- Also, did any of them question the inclusion of studies that were dissertation data and not studies that are published in peer reviewed journals?

- Also, did any of them question the inclusion of studies that involved reflexology?

- Also, did any of them question that the methods of the studies varied significantly.  Do you happen to know if all of the reviewers had access to those dissertations in order to see the original study?  That one is probably difficult to answer since it is usually unknown - so maybe I'll ask a different one.  Why didn't you specify that you were just going to include data from articles that have been published in peer reviewed journals?

There are other questions that come to mind (which only involved exclusion/inclusion criteria- we're taking small steps here) but I'm hoping that others will chine in and ask questions. ag

Thanks again!

I'll pop on later.

 

Cohen's d introduction

You'll find that they use a lot of letters in statistics, there's the p and the d and the F and the g and the like.  To me this is the equivalent of corporations using acronyms for every bit of information - it's just a matter of knowing what they stand for!

If you recall on the video, there was a depiction of the difference "between groups" - in this case being the difference between the control group (which could involve a group of people lying on the massage table, in the exact same environment, or some sort of relaxation therapy or something - depends on the design) and the group that received the treatment (in this case, massage therapy).
And this was shown graphically as the difference between the purple and green lines in this graph:

 


Now, you will hear a lot about "effect size" in statistics and it's a pretty important to understand the concept. Also, it is important to identify when it is missing and to differentiate the concept of effect size from significance testing.  Think of the main difference as being significance testing being a yes or no answer and we shouldn't really go by that alone (Raven covered this is one of her educational magazine articles). 
When two groups are being compared, effect sizes should be provided, and to do that the researchers use means and standard deviations to do their calculations, both of which have been covered earlier.

One of these measures of effect is Cohen's d and you'll see that the Moyer study uses this in Table 2.

Let's take a look at the stats in the Juvenile Arthritis study again and compare them against what Cohen's d is on Table two:

****************************

ON the Moyer study (Effect sizes):
SD, first   SD, last
0.7          0.55

****************************


ON the Field study:
Cortisol levels   - remember that it's the first number is the mean and the second is the standard deviation.

Day 1
                     Pre         Post
Massage       1.5(0.6)  1.1(0.4)
Relax            1.1(0.4)  1.5(0.7)


Last Day
                     Pre         Post
Massage       1.1(0.5)  0.7(0.6)
Relax            1.3(0.4)  1.1(0.8)


So on the Moyer study, how'd they get that 0.7 and 0.55?

Well, look back at the the graph and see that the mean is compared, right?
So basically how they do it is take the mean of one group, take away (or minus) the mean of the second and then divide it by the population standard deviation (which is basically the mean of both SDs).
And then the effect size is then expressed as a standard deviation. 
Why not just use the units of cortisol?
Because in statistics, comparisons and expressions of data are usually described in standard deviations.  It's their common lingo that expression of measurements are translated into - it's just a matter of getting used to it.


Are you confused yet?
Well, don't worry, since there are little calculators like this all over the web if you want to play with them - just plug in the numbers and you'll see where those numbers came from on table 2 (take into account that the data entry isn't the same as the table format).

OK, so here is something you might want to take note of (what I do is have a wee "rule of thumb list" that I have pinned up on the wall near my computer). 


Big or small effects for Cohens d?
Small   .20
Medium  .50
Large   .80


So you can see that the first session actually looks quite large, right?  But remember that the relaxation group had the mean increase after the first massage?  And also, there was a small sample number.

Hopefully I'll get on here again tomorrow - and I hope the researchers will keep me right if I'm giving out wrong information here. 

Thanks.

-Rosemary

 

Introduction to Confidence Intervals and Forest Plots

I hope everyone understands what we mean by mean and that thinking about it too much doesn't turn us into mean people.
I know, bad joke.

Now that everyone knows what what Cohen's d represents, we can move on to Confidence Intervals.

Figure 1 in the Moyer study is an example of a Forest Plot.  It's use in this case is for comparing results of multiple studies, one of which you will recognize as the Juvenile Arthritis study at the bottom:

 


This is the single dose, first in series and you will see a diamond at 0.7.
Yes, that 0.7 is the between group effect sizes, as expressed in Cohen's d.  So those diamonds are basically the between group effect size for the studies, expressed as d.

So what are the lines to the left and the right of each diamond?
Well, these represent confidence intervals and they are pretty important when assessing multiple studies like this.
In simple terms, a confidence interval is a range around a measurement that displays how precise the measurement is.


You probably aren't aware that you are already subjected to confidence intervals in the news when they report that some poll or other showed that 39% of people liked a certain politician and that 1014 adults were polled and it had a margin of error of 3.5 percentage points. So that means that there is a 95% chance (since that is usually the number used) that between 35.5% and 42.5% (that is plus or minus 3.5) of voters like that politican (I know - it's only an example).


But this basically is along the same line as confidence intervals on the graphs. With a 95% confidence interval, you have a 5% chance of being wrong.

Basically, think of the 0 at the bottom as the null hypothesis (sometimes you might see a 1 instead of a 0, but don't worry about that for now).  So what makes count as being significant in this respect?  Basically if the diamond and the lines are to the right of the 0 and the end of the line (or range) does not touch the 0. (this is a rule of thumb).
You can also tell something about the strength of the study by the length of the lines.  Studies with small samples will have longer lines than ones with high sample numbers.
Anyone want to have a guess at what the small diamond at the very bottom means?

As always, researchers keep me right if I'm wrong.
Thanks.

 

-Rosemary
 

Damn Rose

Rose, you're well on your way to being a pretty good statistics teacher.  Good work.

 

Questions about the review process

- During the peer review process did any of the reviewers question the viability of having the Ditzen study included?

No, not that I recall.

- Also, did any of them question the inclusion of studies that were dissertation data and not studies that are published in peer reviewed journals?

No. 

- Also, did any of them question the inclusion of studies that involved reflexology?

Again, no.  The inclusion/exclusion criteria were not among the reviewers' concerns, if I recall correctly.

- Also, did any of them question that the methods of the studies varied significantly.  Do you happen to know if all of the reviewers had access to those dissertations in order to see the original study?  That one is probably difficult to answer since it is usually unknown - so maybe I'll ask a different one.  Why didn't you specify that you were just going to include data from articles that have been published in peer reviewed journals?

It would be pretty unusual for the reviewers to go to the time and trouble of looking up the original data in a case such as this.  To the uninitiated, this may sound careless on the part of the reviewers, but it really isn't.  The reason it isn't is because our study provides sufficient detail for *any* reader to check the data herself.  For example, are you wondering if the data from dissertations and theses differ from published studies?  Well, we're giving you the data, in number and in figures and in references, so you can check it yourself if you like.  And if someone does that and finds that my colleagues and I blew it, then they can write their own paper that points this out.  Our methods are totally transparent.

The reviewing history of this paper is interesting in other ways, though.  I am definitely happy that it was eventually accepted for publication in JBMT, but that is not the first journal it was submitted to.  I have wanted to get papers accepted in a wide range of journals, and this paper was submitted to Psychosomatic Medicine and to Psychophysiology before it was submitted to JBMT.  I forget which one of thsoe was first, but I'm pretty sure it was PM (I could look it up if you care).  We got a bogus review from PM, in that one of the reviewers criticized the paper but without giving substantive criticisisms.  The review was so unprofessional that I ended up drafting a letter to the editor of that journal.  Since then, I have learned that I must instruct journal editors that certain potential reviewers cannot be relied upon to have scientific integrity.  Sadly, I have other anecdotes of this type.

Psychophysiology sent the paper back without a review, as they concluded the paper was not a good fit for their journal.  At least they didn't waste our time.

-CM

Bad reviewing

An earlier version of the paper submitted to Psychosomatic Medicine recieved three reviews (a typical number).  Two of these were conscientious reviews that raised substantive points, but the third was not conscientious or substantive.  I present it here in its entirety:

The mission of this paper is important. However, it has many fatal flaws as follows:

1)  The logic is flawed if the authors expect as they suggest that anxiety and depression would be reduced by massage therapy but not their biochemical correlate, cortisol. It is not clear what they hypothesize should happen to an underlying biochemistry when  mood states like depression and anxiety shift?;

2) Typically meta-analyses are not possible without variance figures. Most of the studies given in the table do not have S.D.s and the authors claim that they could not get these from the authors of the original papers. That may be that many authors did not have them because of inadequate analyses or did not want to release them to the authors of this meta-analysis who had previously published flawed meta-analyses (one meta-analysis paper by the senior author that was based on 2 studies and another based only on 7 studies, both papers also negating the effects of massage therapy);

3) The methodology is fatally flawed, as minimal criteria were used for the meta-analysis in this paper. With the criteria being so relaxed, many good studies that were randomized and featured control groups were mixed with many bad studies that did not have those minimal features. 20 of the 48 studies were not randomized studies. 16 of the 48 studies had no control group. Several studies had as few as 8 or 9 subjects and 2 studies had 1 or 2 subjects. Different types of massage therapy were used in the different studies.  Values taken from  different types of samples were averaged (e.g. blood and salivas) which is not legitimate. In a word, mixing quality with non-quality studies would be expected to yield a result that is at odds with much of the literature. As stat teachers have been heard to say "garbage in-garbage out."

My response

That review was so unscientific and unprofessional that I felt compelled to let the journal editor know about it.  Here is what I wrote to him.  (Note that the sections which refer to effect sizes without variability data, and studies without control groups, refer to a dataset that we later removed from the final version of the paper - though not in response to this review, but for other reasons.)

December 7th, 2009

David S. Sheps, MD, MSPH
Editor-in-Chief

Psychosomatic Medicine: Journal of Biobehavioral Medicine

Dear Dr. Sheps:

Thank you for your journal's timely review of our manuscript, #PSY09-424, entitled "Cortisol Reductions in Response to Massage Therapy: A Comprehensive Quantitative Review". Logically, we are disappointed that the manuscript was not accepted nor that a revision was requested, but we appreciate that the journal stood by its commitment to review the article in eight weeks, as promised. We are also appreciative of the time and effort that two of the reviewers put in to this review, because we will be able to use their criticisms despite the fact that our study will not be appearing in Psychosomatic Medicine: Journal of Biobehavioral Medicine.

We are writing, however, to express our dissatisfaction with the third of the three reviews. Though it is now water under the bridge, it is unfortunate to consider that the third review may have been the one that tipped the balance from "revise and resubmit" to "rejected." This is because the third reviewer did not actually make substantive criticisms of the study. Though it is now merely an academic exercise to do so, I feel compelled to illustrate this, and even hope you might forward this letter to the third reviewer.

In what follows, the third review appears in its entirety, interspersed with my remarks.

The mission of this paper is important. However, it has many fatal flaws as follows:

1) The logic is flawed if the authors expect as they suggest that anxiety and depression would be reduced by massage therapy but not their biochemical correlate, cortisol. It is not clear what they hypothesize should happen to an underlying biochemistry when mood states like depression and anxiety shift?;

Just because the study does not achieve a result consistent with this reviewer's concepts of anxiety, depression, and the biochemical correlates, it does not follow that the logic is flawed. Anxiety and depression are not perfectly correlated with cortisol or any other biochemical parameter, which is beside the fact that the study makes no attempt to explain precisely how massage reduces anxiety and depression. The goal of the study, as plainly stated, is to comprehensively quantify the relationship between massage therapy and subsequent changes in cortisol levels, which it does.


2) Typically meta-analyses are not possible without variance figures.

We are well aware that meta-analytic results cannot be generated without variability data, and we plainly state that we perform the multiple analyses we have performed are due, in part, to the unavailability of some data.

Most of the studies given in the table do not have S.D.s and the authors claim that they could not get these from the authors of the original papers. That may be that many authors did not have them because of inadequate analyses or did not want to release them to the authors of this meta-analysis who had previously published flawed metaanalyses (one meta-analysis paper by the senior author that was based on 2 studies and another based only on 7 studies, both papers also negating the effects of massage therapy);

The reviewer accuses the first author's other studies of being flawed, but makes to attempt to explain how they are flawed. What type of criticism is that?

The issue of data being unavailable, even when directly requested, has a history in massage therapy research. Often, as the reviewer notes, the data are not available because many of the persons who performed individual studies were not sophisticated researchers. However, it is also true that the laboratory that has done more massage therapy research than any other has claimed, in direct correspondence with me, that much of their data was missing (including some that was not very old) following a relocation of their offices. This was after telling me that they would look for it, and then waiting months to tell me that they never had it. When our meta-analysis was accepted by Psychological Bulletin, the editor of that journal indicated that the set of studies for which variability data was inexplicably missing needed to be mentioned in the text of the article itself. This was done, probably at some embarrassment to that laboratory.

In addition, the reviewer seems to be implying that meta-analytic results based on a small number of studies are somehow completely invalid. This isn't the case, of course. They simply are what they are, a quantitative summary of the best available evidence. If the result is based on a small number of studies and/or has a wide confidence interval, that is a finding in and of itself. In fact, many methodologists have pointed out that, no matter what the number of studies, a meta-analytic treatment is always best.

3) The methodology is fatally flawed, as minimal criteria were used for the meta-analysis in this paper.

This is the old "apples and oranges" criticism of meta-analysis. It's not valid when the category to which we wish to generalize is not apples, nor oranges, but "fruit." We wish to see if massage therapy reduces cortisol. Not only does it appear that it does not, but there is no evidence of variability among the effects (we plainly state the results of having tested for this). If there were differences among different forms of massage therapy, then why is there no variability?

With the criteria being so relaxed, many good studies that were randomized and featured control groups were mixed with many bad studies that did not have those minimal features. 20 of the 48 studies were not randomized studies. 16 of the 48 studies had no control group.

We clearly delineate studies according to those features, which makes this criticism completely irrelevant.

Several studies had as few as 8 or 9 subjects and 2 studies had 1 or 2 subjects.

This criticism is also completely irrelevant, as all the studies are weighted according to their size (either sample size or inverse variance, depending on the analysis).

Different types of massage therapy were used in the different studies.

This is true, as noted above. There are probably no two studies in the massage therapy research literature that have used the same protocol, or even if there are, they would be the exception. This same issue arises in all clinical research.

Values taken from different types of samples were averaged (e.g. blood and salivas) which is not legitimate.

This is certainly not a fatal flaw, and reviewer one even goes so far as to point out that the different methods of cortisol assessment do not differ in their findings. Reviewer one supports that assertion with a citation.

In a word, mixing quality with non-quality studies would be expected to yield a result that is at odds with much of the literature. As stat teachers have been heard to say "garbage in-garbage out."

Our study, more than any study to date, plainly presents the results of massage therapy on cortisol. It does this transparently and quantitatively. The fact that the results contradict some of the literature is the point of the study, not a weakness. Finally, there is no pattern to indicate that our finding is the result of a GIGO situation.

I wish to mention that, prior to submitting this study to your journal, I thought about requesting that it not be reviewed by persons associated with a particular laboratory, as their interactions with me have consistently been unprofessional. I wanted to avoid making such a request, however, in part because I knew that we were able to respond to any reasonable and substantive criticisms of our work, and I was confident that any careless review of our work would be disregarded. I suspect now that I made a mistake in not having made such a request. I suppose I can thank you and your journal for likely having helped me learn a disappointing but important lesson about the subfield of massage therapy research.

In sum, we feel it is very unfortunate that this review was given any weight at all, as it makes no substantive criticisms and even plainly demonstrates the reviewer's poor familiarity with the methods of quantitative review. The third review contrasts starkly with those of the first two reviewers, who plainly took the task seriously and completed it conscientiously.

Sincerely, and on behalf of my coauthors,

Christopher A. Moyer, Ph.D.
Assistant Professor of Psychology
University of Wisconsin-Stout

I agree

Rose, you're well on your way to being a pretty good statistics teacher.  Good work.

 

Very nice work--you've done a great deal of preparation, and it really comes through clearly.

I think one thing people find frustrating about research

is that sometimes scientific consensus changes.

Sometimes it may look arbitrary--eggs are bad for you; no, they're good for you; no, they're bad for you again--but here, we're getting a glimpse behind the curtain into the process.

Well, we're giving you the data, in number and in figures and in references, so you can check it yourself if you like.  And if someone does that and finds that my colleagues and I blew it, then they can write their own paper that points this out.  Our methods are totally transparent.

 

There's no magic and no overriding authority in research. There is only a lot of people's hard work.

And, unfortunately, sometimes people make errors, and then other people come along behind and find those errors.

Chris is saying here that his team's work is subject to exactly the same kind of scrutiny that his team performed on Field's team's work.

If their work is solid, and no one finds methodological flaws in it, eventually, it will become the consensus. If flaws are found, it will be critiqued and redone.

That's one of the strengths of research, but the lack of certainty can be frustrating when it comes time to turn this basic science information into clinical decision-making.

psych question for you, Chris

since you mentioned anxiety as one of the conditions for which the research pretty clearly showed an effect for massage.

Upstream, Robin raised the question about the definitions of stress and anxiety, both as scientists understand the terms, and also how the public perceives the difference between them.

You also mentioned that you liked Selye's conceptualization of stress.

Would you mind summarizing briefly both Selye's conceptualization of stress, and what the current understanding of the difference between stress and anxiety in psychology is?

If you have insight into how that ties into the lay public's understanding, that would be useful, too.

Transparency

Transparency is a good thing - no doubt.

Also, when there is a self-fixing mechanism in place, whereby one researcher will take the data, handle it differently and basically throw a light on the fact that there was a major flaw on how the data was analyzed by another researcher is also a good thing.

What isn't so cool is that there are studies where the data isn't available - for example, the Lawler/Cameron study didn't have the control group data, so it couldn't be included in this analysis.  Also, there were studies that had missing data from not providing the SDs for groups, so basically, the lack of transparency in the research upon this research is being done is an issue.

Moreover, there is the question of methods.  I know there is an emphasis on "look at the data and handle it right" here (which is good), but before we even get the data, we need to ask ourselves "What exactly was the question in the original study and how good is the methodology to answer that question?"  If either of those is off, the data isn't worth diddly squat not matter how it's handled.  This is from the perspective of a massage therapist.

As an example of where I'm coming from:

Say there was a study and as part of the methods a family member is trained for about 20 minutes on sitting in a chair, asking a few questions on how someone feels, their thoughts and behavior and taking some notes and this is put forth as "The effects of Psychotherapy on......".

Well, you can imagine how that would go down.  If you replace that with any sort of therapy e.g. physiotherapy or occupational therapy, then you can imagine how people in those professions would react.  It seems to me that a quite a number of researchers regard massage therapy as nothing more than a bit of rubbing that can be taught to someone in a few minutes.  I could take that as an insult, but I won't, because I'm a massage therapist and I'm pretty laid back wink.  There was another post on POEM regarding the "curation of massage therapy".  I think it's time that there was some curation of the use of the term, otherwise we're nothing more than people that give others a wee shoulder rub (I'm also aware that some therapists have t-shrits with "I'm a rubber" on it - yep.  not me though).  I'm not sure about anyone else, but when someone approaches me outside of the work environment and said "can you just give my neck a rub, since you do massage" it sort of irks me.  I politely give them my card and ask them to make an appointment.  There's a reason for that.

 

Also, I sometimes wonder if the fact that we're not taught research literacy skills has lead to poor quality studies.  Take this abstract, for example (one of the studies used).  I don't know about anyone else, but not even having the correct spelling for Therapy, never mind "stress" doesn't scream "high quality" to me.  And if you look at it a bit more, you'll see that it's not exactly telling us very much, it's badly formatted and oh, look at the bottom of that to see where it got it's funds. 

Thanks for answering my questions, Christopher.  I'm also glad that you're being very frank about the feedback you got and the peer review process.  I think others will find that interesting too.

By the way, people should be aware that peer reviewers are not always people that have a lot of experience in reading statistics.  Bear this in mind when assessing how good a  journal is.  Personally, if I see that a journal has missed something big when it comes to statistical handling, I won't bother looking at anything in that journal again.  But that's just me.

Anyway, I'll pop on again later if I've time.

 

 

 

You've put your finger on it

Moreover, there is the question of methods.  I know there is an emphasis on "look at the data and handle it right" here (which is good), but before we even get the data, we need to ask ourselves "What exactly was the question in the original study and how good is the methodology to answer that question?"  If either of those is off, the data isn't worth diddly squat not matter how it's handled.  This is from the perspective of a massage therapist.

 

Precisely. This is the question which is the crux of all the analysis we are doing of research articles.

All the other questions, such as "did they use the right statistics? did they randomize properly?" and so forth are all refinements of your question, which is at the heart of any study.

There are several different things going on here

 

Say there was a study and as part of the methods a family member is trained for about 20 minutes on sitting in a chair, asking a few questions on how someone feels, their thoughts and behavior and taking some notes and this is put forth as "The effects of Psychotherapy on......".

Well, you can imagine how that would go down.  If you replace that with any sort of therapy e.g. physiotherapy or occupational therapy, then you can imagine how people in those professions would react.  It seems to me that a quite a number of researchers regard massage therapy as nothing more than a bit of rubbing that can be taught to someone in a few minutes.  I could take that as an insult, but I won't, because I'm a massage therapist and I'm pretty laid back wink

 

Yes, that's quite true. On the other hand, we deal in touch, and touch is something that goes way further back than just human history. Many different kinds of animals use touch in ways that aren't simple and obvious, that we have to explore through neurology, endocrinology, and other disciplines in order to understand.

So just touching and rubbing can have beneficial effects. And we use touching and rubbing for just those reasons. And yet, as you point out, what we do is so much more than just that. Which brings up several issues that affect how we apply this research, as well as how we communicate with clients and other healthcare practitioners.

We have not yet talked about the several different types of validity involved in studies, but one of the issues that validity has to do with is "how much can the effects of this study be expected to apply in different populations and treatment situations?".  So does having a family member perform a rigid protocol for 20 minutes every day really tell us anything useful about what we do in the clinic?

In addition to questions about validity, it also brings up questions about how we communicate to other stakeholders about what it is that we really do. Do we have enough consensus on what that is that we can do so reliably, or is that something that we need to agree on in-house first?

 

There was another post on POEM regarding the "curation of massage therapy".  I think it's time that there was some curation of the use of the term, otherwise we're nothing more than people that give others a wee shoulder rub (I'm also aware that some therapists have t-shrits with "I'm a rubber" on it - yep.  not me though).  I'm not sure about anyone else, but when someone approaches me outside of the work environment and said "can you just give my neck a rub, since you do massage" it sort of irks me.  I politely give them my card and ask them to make an appointment.  There's a reason for that.

 

It sounds like you are clear on your professional boundaries. I think being clear on that is a good thing--not that everyone draws them in the same places as everyone else, but knowing what's right for you and sticking to it is what I think is good.

Touch is good....

....but you don't have to be trained in it and anyone can do it.

If most of the studies changed "massage therapy" to "touch" and referred to it as "touch", I would have no problem with that.

There is a serious lack in the definition of the term and it can cover anything - and that is partially got to do with the state of the industry - the lack of standards in education in it, the fact that we do so much *stuff*, the political side of it all etc.  The Moyer study included reflexology studies.  I don't count that as massage therapy and would suspect that most MTs that view themselves as evidence-based wouldn't count it as massage therapy either.  I also have issue with studies that have data from dissertations.  That's because of the level of bias in dissertations (which I think would be higher than published articles) and without putting a lot of effort put into getting getting them, none other than academics can easily get access to them to see how good the methods are in them.  But that's just me.  So if I were to throw out all the studies that were dissertations, that didn't count as *massage therapy* to me and whose methods seems lacking, then I'm not left with much.   So as a consumer of research from an MT perspective and as a stakeholder, I appreciate the fact that this study highlighted that the data was handled wrong and that the cortisol claim shouldn't have been in our textbooks in the first place, but what else can I get from this?  It's a learning experience, yes.  And that's good.  Has it made me more cynical?  Probably.  Has it made me think that for the most part, the peer review process is broken in a lot of journals?  Yes.  Has it made me think that massage therapists must not be involved in research?  Yes.  Has it made me think that every scientist involved in MT research should be forced to sit a statistics exam before they can get money to do anything?  Yes.

But I'm not throwing out the baby with the bath water - just yet! smiley

And this is just me........for everyone else who is a prefessional toucher, I'm sure they've got a lot out of it!

-Rosemary

Yes, the process clearly broke down

The question we're not sure about is exactly where

I don't know about anyone else, but not even having the correct spelling for Therapy, never mind "stress" doesn't scream "high quality" to me.

 

Yes, that's embarrassing. But I have no doubt that they all know how to spell those words; this is not an issue of their not knowing.

It may have been that they were in such a rush for whatever reason that their process totally broke down. Having turned in both papers and grants on tight deadlines, I can envision such as scenario, although those errors are both so elementary and so flagrant that I have a hard time seeing how they got past even the most frantic deadline rush.

I have a different idea of what may have happened. Remember, this is a 1996 paper. Not everything was so dependably online 15 years ago as it now is.

At that time, a lot of things existed in print only.

I can imagine a scenario where this is a print article that the publisher later decided to scan in, and that the publisher end was sloppy about looking for errors introduced into the text by optical character recognition (OCR) programs.

I will be over at the UW on Friday, and I will see if the article is available there (for me at home, it is behind a paywall). If it is there, I will see whether the article really contains those errors, or if it's only the record pointing to the article that does.

Not that I disagree with your general point that if authors are sloppy about writing, editing, and proofing, that may be a symptom of sloppy labwork as well. But this is so embarrassingly bad, that I actually find it hard to believe it made it past even the most harried team on deadline.

 

And if you look at it a bit more, you'll see that it's not exactly telling us very much, it's badly formatted and oh, look at the bottom of that to see where it got it's funds. 

 

It's true that when you get corporate money, you have to be scrupulously careful to avoid even the perception of bias, and that corporations often don't have the institutional safeguards against bias that institutions like the National Institutes of Health (NIH) and the National Science Foundation (NSF) have.

But when only 5% or 10% or 20% of NIH and NSF grant applications receive funding, it's also clear that corporate funding of research is not going to go away any time soon, either.

It's a fact of research life, and it's something we need to be aware of when evaluating research. It's one of the reasons why researchers disclose their funding sources in articles--to give the reader the opportunity to make the determination of how much weight to assign that possibility.

Thanks for checking that

I wondered if it was just a typo with data entry - but even at that, you'd have thought someone somewhere would have corrected it - especially since it's glaring in the title.  But even if it is typos and that was corrected - would you regard it as a well written abstract?

Also, I hope I'm not coming across as a big negative naysayer.  I'm a big believer in research as a lever to raise the industry up.  And I also believe that a mix of researchers is a really good thing. 

Anyway - I'll get back to the study again here.  I think that's probably best - otherwise we're just off in opinion-land.

Percentage change

OK, so way back at the start of the thread I put a reference to a study that used % change. 

And this is from that study:

Hopefully, everyone understands why this can be misleading to a reader who is not familiar with the reasoning behind a control group and why % change is inadequate. 

Use of % change and % difference and when they make sense and when they don't is pretty important.  If anyone has a questions on that, please ask it - and make use of the fact that the lead author in this study is a *data guy*!

Also, I'm not sure if anyone had a look at the Moraska study, but I recommend people look at it to see what is said in it about cortisol measurement. 

Has anyone looked at some of the full texts that I gave in an earlier post? 

 

-Rosemary

Stress and anxiety, stress vs. anxiety

Selye's definition of stress, at first, seems too simple.  He defined stress as the body's response to *any* change in the environment.  It logically follows that the absence of stress is death, since dead things do not actively respond to their environment.

But Selye's insight is brilliant if you think about it.  Whenever the body has to adapt, this introduces wear and tear, even if the event that requires adaptation is a positive event.  For example, if you get a terrific new job, this is positive but it is also a stressor - maybe you have to move, you have to tell your old job you're leaving, you might have to make a new budget, etc.  And while it is true that negative events might lead to more complications than positive events, it is a useful concept to consider that stress is when the body has to adapt.  This is the best conceptualizaton of stress that I have encountered.

Now, "stress" is also a word that folks use in their everyday lives, almost always to indicate something that is negative, and this is where confusion can arise.  Many folks, including some scientists, use the word "stress" in an imprecise way.  If you see a scholarly or scientific article that refers to stress, slow down and ask yourself what the researchers mean by stress.  Have they indicated it clearly?  If not, it is likely to lead to confusion, in my experience.

While most people *think* they know what stress is even though they do not think it through in detail, I find many people are skeptical that anxiety is a well-defined concept, even though it is.  Anxiety is a mood state that is similar to fear, but it differs from fear in that it is future-oriented.  Where fear might equate to "(Bad event X) is about to happen to me," anxiety equates to "(Bad event X) might happen to me eventually."  Even though this - anxiety - is a mental event that cannot be observed directly, we can (and have) come up with valid and reliable ways to assess whether an anxiety state is occurring in a person or animal.  If certain behaviors (including answers or survey responses) are elicited in the presence of a possible aversive event, it is reasonable to infer that a mental state, which we call "anxiety," is being assessed.

This is a simplification, of course.  It raises questions about validity, reliability, materialism vs. dualism, and many others interesting controversies.  If this interests you greatly and you are willing to put the rest of your life on hold, you might want to consider going to graduate school for psychology.  If you're not willing to do that, then maybe you'll be satisfied by reading the manual of the State-Trait Anxiety Inventory.  If you're not even willing to do that, then you might have to take my word on it that anxiety is a reasonably well-validated scientific concept that can be meaningfully measured in both humans and in some animals.

-CM

analog studies, etc.

As an example of where I'm coming from:

Say there was a study and as part of the methods a family member is trained for about 20 minutes on sitting in a chair, asking a few questions on how someone feels, their thoughts and behavior and taking some notes and this is put forth as "The effects of Psychotherapy on......".

This is an excellent point.  And yet, studies such as this are sometimes conducted.  They are sometimes known as "analog" studies, because the intervention being used is considered analogous, but not idential, to what really should be studied.

I have never done an analog study, and I try to avoid them.  But, I understand why they are conducted, and I might do one myself someday.  Essentially, they are a 'weak' test of a treatment.  If talking to someone improves their psychological condition, it does logically follow that psychotherapy - in which a highly trained person talks with someone - should work at least as well as having an untrained person.  But if it doesn't work, it is still entirely possible that psychotherapy works, because the specialized aspects of psychotherapy haven't been tested in an analog study.  The same logic applies to massage research.  If *any* touch intended to be therapeutic is helpful, it is reasonably to conclude that true massage therapy will be helpful, but the reverse - if untrained touch is ineffective, legitimate massage therapy training will also not be effective - is not necessarily true.

True clinical research can be very difficult to conduct.  It requires money, resources, and the availability of experts, and that's just for starters.  When any one of those things aren't available, the next best thing might be to do an analog study.  As I stated previously, I have avoided doing analog studies myself, at least up until the present date, because they have important limitations.  However, I do understand why researchers sometimes conduct them, and I wouldn't entirely rule out, ahead of time, the possibility of ever conducting one.

-CM

Accuracy, typos, and such

Concerning data entry and typos and reliability - in graduate school we read a paper about survey research, and I think the author was Norbert Schwartz but I could be wrong about that - and the point was made that data can never be 100% accurate.  If you ask people to indicate their own gender, the reliability of that data has been observed to be "only"about r = .96.  In other words, even if the thing you are asking about is something people should know about themselves with practically no ambiguity, there will still be keystroke errors, misunderstandings, and the like.  No data, not even the most basic, can be gathered with perfect reliability.

-CM

Reliability of data gathering

Important note

UPDATE, 15 September 2011: I'm not one for deleting posts or comments without very good reason, because I think accountability demands you stand behind what you say, rather than just changing the record. However, after I had posted this comment, Chris pointed out a couple of ways in which it was in error.

So don't follow the advice in this comment; from the feedback, I learned that I was thinking about it wrong, by casting it in terms of percentages.

Here's a better explanation that you should use instead of this one.


data can never be 100% accurate.  If you ask people to indicate their own gender, the reliability of that data has been observed to be "only"about r = .96.  In other words, even if the thing you are asking about is something people should know about themselves with practically no ambiguity, there will still be keystroke errors, misunderstandings, and the like.  No data, not even the most basic, can be gathered with perfect reliability.

 

That makes sense. Information in the real world always contains a certain amount of noise.

Understanding statistics is not a requirement for participating in Journal Club, so I'm not assuming that everyone here is familiar with the statistic r (Pearson's r).

r is a measure of how reliably two things correlate with each other--in this case, the person's gender in real life, and the gender that they indicate in their answer on a questionnaire.

You would expect a perfect correlation--100% reliability--but, as Chris points out, it's been observed that

r = .96

 

There are several statistical measures that can range in value from 0 to 1*, and which can easily be converted to percentages. We'll talk about those statistics as we encounter them.

r is one of those statistics, so you can multiply it by 100% to find out what that means in percentage terms.

.96 x 100% = (.96 x 100)% = 96% 

 

So r = .96 means a reliability of 96%, and an error rate of 4%.

If something that relatively straightforward has an error rate of 4%, it would be reasonable to expect the error rate to go even higher when the questions get more complicated, such as medical history.

 

* r actually ranges from -1 through 0 to 1; we will talk about what the negative values mean later, and just treat the 0 to 1 part of r's range as 0% to 100%.

Thank you for the clarification

They are sometimes known as "analog" studies, because the intervention being used is considered analogous, but not idential, to what really should be studied.

 

I had never heard the term "analog studies". 

Which, when you think about it, is really odd, since so many of the protocols in MT research would fall under that term.

Just never came across it, and I don't remember any studies where they explicitly called it an analog study. In fact, they do say they are studying "massage", which gets back to the point about definitions.

my opinion of the 1995 abstract

But even if it is typos and that was corrected - would you regard it as a well written abstract?

 

With the caveat that I have 15 years of hindsight to draw upon, so it's a little unfair to judge a 1995 article by 2011 standards, I would still have to say I don't think it's especially well-written. There's just not enough information about methods there. It's very general and high-level:

Forty full-term 1- to 3-month-old infants born to depressed adolescent mothers who were low socioeconomic status (SES) and single parents were givn 15 min of either massage (n = 20) or rocking for 2 days per week for a 6-week period. The infants who experienced massage therapy compared to infants in the rocking control group spent more time in active alert and active awake states, cried less, and had lower salivary cotisol levels, suggesting lower stree. After the massage versus the rocking sessions, the infants spent less time in an active awake state, suggesting that massage may be more effective than rocking for inducing sleep. Over the 6-week period, the massage-therapy infants gained more weight, showed greater improvement on emotionality, sociability, amd soothability temperament dimensions and had greater decreases in urinary stress catecholamines/hormones (norepinephrine, epinephrine, cortisol).

 

One thing I prefer is a structured abstract. Here's an example of a structured abstract for a study that included massage:

 

Judson PL, Dickson EL, Argenta PA, Xiong Y, Geller MA, Carson LF, Ghebre R, Jonson AL, Downs LS Jr. A prospective, randomized trial of integrative medicine for women with ovarian cancer. Gynecologic Oncology. 2011 Aug 22.
 
Abstract
OBJECTIVES:
Despite increased use of integrative medicine in cancer therapy, little data exist on its efficacy. This prospective, randomized, pilot trial sought to evaluate the feasibility of combined modality integrative medicine (CM-IM) in women with ovarian cancer (OvCA) and evaluate its effects on quality of life (QoL), chemotherapy toxicity and immunologic profiles.
 
METHODS:
Women with newly diagnosed OvCA requiring chemotherapy were offered enrollment. Those randomized to the experimental arm received hypnosis, therapeutic massage and healing touch with each cycle of chemotherapy. The control arm received chemotherapy without CM-IM. All patients completed QoL questionnaires prior to cycles 1, 3 and 6, and 6-months after chemotherapy. Immunologic profiles were measured. Statistical analysis was based on intent-to-treat. Student's t-test and Fischer's exact-test were used to determine differences.
 
RESULTS:
Forty-three women enrolled. All women randomized to CM-IM were successfully treated. There were no statistical differences between the groups in age, stage, grade, histologic cell type, CA125 levels, or surgical cytoreductive status. There was no difference in overall QoL measurements. Re-hospitalization rates, treatment delays, anti-emetic use, and infection rates were similar. Immunologic profiles revealed no difference between arms for WBC or salivary IgA levels. Women receiving CM-IM had consistently higher levels of CD4, CD8 and NK cells, although this did not reach statistical significance.
 
CONCLUSIONS:
Prospective clinical evaluation of integrative medicine for women with gynecologic malignancy is feasible. This first, pilot study of CM-IM in gynecologic oncology demonstrated no improvement in QoL or chemotherapy toxicity. Integrative medicine-associated improvements in immunologic profiles warrant further investigation.
 
(Note that I haven't evaluated the article yet. I am only using this as an example of a structured abstract with headings that correspond to the different parts of the entire article.)
 
That's the kind of structure and detail I like to see in an abstract. It's not always practical, but when I'm using an abstract to decide whether getting the entire article will be useful, there's enough detail to make a reasonable decision about whether the study did what it said it set out to do.

Not exactly

So far your presentation of statistical information has been terrific, but with Pearson's r you don't have things quite right.  You suggest that the statistic is called r because that stands for reliability, but that's not the case, and Pearson's r can be used in many ways other than measuing reliability - it's simply a standardized way of quantifying the correlation between any two variables.  (There is a reason it's called r and I used to know it, but I've since forgotten.)

Also, you cannot treat r as a percentage, as you suggest.  In some cases it is appropriate to square an r value to get a new statistic that is the percentage of variance accounted for by a statistical model, but r itself is not a percentage and cannot be interpreted that way.

Rose, here is an extremely nerdy little tool that one can use to get a feel for what r values represent.

http://www.rossmanchance.com/applets/guesscorrelation/GuessCorrelation.html

-CM

analog studies, etc.

I had never heard the term "analog studies". 

Which, when you think about it, is really odd, since so many of the protocols in MT research would fall under that term.

I learned the term as part of psychotherapy research.  Its usage might be limited to that arena, but I agree that it could be extended to other research areas such as massage.

-CM

Abstract types

As someone who reads in and publishes across fields, I've noticed that the traditions for abstracts (among other things) varies.  The abstacts with headings seems to predominate in medicine and related fields, while the single paragraph, no-heading style seems to occur more often in psychology and related fields.  Here is an example of the latter from one of the premier journals in psychology:

http://pss.sagepub.com/content/early/2011/08/18/0956797611419302.abstract

Either format can be effective, but good writing is always necessary for a good abstract.  And since the abstract is the only part of the paper most people will ever read, it's worth writing it well.

Oops!

So far your presentation of statistical information has been terrific, but with Pearson's r you don't have things quite right.  You suggest that the statistic is called r because that stands for reliability, but that's not the case, and Pearson's r can be used in many ways other than measuing reliability - it's simply a standardized way of quantifying the correlation between any two variables.  (There is a reason it's called r and I used to know it, but I've since forgotten.)

Also, you cannot treat r as a percentage, as you suggest.  In some cases it is appropriate to square an r value to get a new statistic that is the percentage of variance accounted for by a statistical model, but r itself is not a percentage and cannot be interpreted that way.

 

Thanks for the correction; I've apparently been using r in ways that I shouldn't. I appreciate your pointing out my error, so that I can change that.

I'm not one for deleting posts or comments without very good reason, because I think accountability demands you stand behind what you say, rather than just changing the record. I will, however, put a prominent note at the top of the comment.

Cheers, Chris!

 

Let's try interpreting r again

UPDATE, 10:53 AM PT: Chris noted some improvements that this explanation needed, so I've edited the comment below to incorporate them.


As I mentioned earlier, knowledge of statistics is not a prerequisite for participating in Journal Club.

We'll deal with what particular statistics mean as they come up, but if you see that someone else already has, and is using, statistical knowledge, don't let that discourage you from commenting.

The statistic r came up in the discussion. When Chris says

r = .96

 

all by itself, if you are not familiar with what that statistic means, then the meaning of that number is not immediately obvious.

As Chris explains above, r is a standardized way of quantifying the correlation between two variables.


A variable is anything that can change, or vary. In the example of people reporting their gender in answer to a questionnaire, there are two variables:

  • The person's actual gender can vary in real life between male, female, and a number of other possibilities, including intergender, transgender, and others.
    I am assuming that, in this case, since the researchers wanted to look at reporting errors, they already adjusted the data so that they were looking only at straightforward cases where there was no possibility that the person reporting would not find the option they were looking for on the form--after all, that would not be the reporting person's error. So I expect that in this study, the people were selected so that the person's actual gender varied only between clearly male and clearly female.
  • The person's reported gender on the questionnaire can vary as it does in real life, and again, I expect that for the sake of clarity on what is a reporting error, the choices were only male and female.

Correlation means that two or more things co-occur (occur together) or co-vary (vary in a way that is related to each other). For example, in a positive correlation, if one increases, then the other one increases, and vice versa. An example of a positive correlation is age and height in normally-developing children. As these children get older, they also get taller.

A negative correlation is a situation where two things vary in connection with each other, but in opposite ways--as one variable increases, the other decreases. For example, the older you are (the more years you already have), the fewer years you have remaining to live. So age and the remaining years to live are negatively correlated.

Correlation can be due to causation, but it doesn't have to be: that assumption that they are the same thing is what the saying "Correlation is not causation" warns against. So when we talk about the relative correlation of things in terms of r, we are just talking about their co-occurence or co-variance--we are not saying anything here about actual cause and effect.


Quantifying means assigning a number to something, and the power of assigning numbers to something is that--if the numbers are assigned in a rigorous and standard way--you can use those numbers for comparison to each other.


So by quantifying the correlation between two variables in a standard way, we can begin to talk about the relative strength of that correlation. In young children, for example, height and age co-vary strongly, while the correlation becomes weaker in older children, as they reach the limits of their growth.

 

r ranges from -1 to 0 to 1, in the following way:

 

So when Chris says r = .96, he means that it's almost perfectly correlated, but not quite, for a correlation you would expect to be 1, or perfect (that people's reported gender would perfectly track their actual gender).

Umm..... I did not cover r, but thanks.

Rose, here is an extremely nerdy little tool that one can use to get a feel for what r values represent.

It wasn't me that mentioned r, but thanks for the link.

-Rosemary

It wasn't me that mentioned

It wasn't me that mentioned r, but thanks for the link.

Oops, my mistake.  I see now that was Raven's post.

Maybe you and Raven are melding into one virtual person on this site, the way you and I did over on the [other] Site.  :)

-CM

(UPDATE, 10:09 AM PT: I've changed one word in Chris' post, as I've made a promise to all of you to enforce the POEM policies equally across the board, and this one change brings it into alignment with those policies.--RST)

I'm going to be picky

The updated info on correlation is better, but there are still some things I must take issue with.  Your wording of "direct" and "indirect" is imprecise and misleading; what you mean to say in those cases are "positive" and "negative."  Positive correlations are when two variables are moving in the same direction; as one increases, so does the other.  Negative correlations are when two variables move in opposite directions; as one increases, the other decreases.  There is nothing about a negative correlation that is any more or less direct than a positive correlation.  Further, if the scale we are using is arbitrary, we can always reverse one of the scales which will have the effect of changing the sign of a corresponding correlation coefficient.

As noted before, correlations are bounded between -1 (a perfect negative correlation) and 1 (a perfect positive correlation).  A value of zero indicates no correlation at all.

Your example of height and weight in developing children is an excellent example.  As one increase, so does the other tend to increase, but as we all know those two variables are not perfectly correlated.  Sometimes people get taller without gaining weight, and sometimes they gain weight without getting taller.

Your next example, in my opinion, is not a good example, even if we replace the word "indirect" with "negative."

A negative correlation is a situation where two things vary in connection with each other, but in opposite ways--as one variable increases, the other decreases. For example, the faster (more speed) you drive to your destination, the less time it takes you to get there. So speed and the time the drive takes are indirectly correlated.

The reason I do not like this example is because the two variables - rate, and time - are perfectly correlated; if we know the rate at which something is travelling across a given distance, we can perfectly predict travel time.  And when two variables are perfectly correlated, they are not really two variables at all, but are only a single concept. 

To give another example of this, degrees F and degrees C are perfectly positively correlated, because they are actually the same thing (temperature) just on a different scale.  For a negative correlation example, if we know how much individuals weigh in pounds, this will be perfectly and negatively correlated with how much less they weigh, in grams, than the Empire State Building.  Both scales are the exact same thing, conceptually (weight).

A better example of a negative correlation would be a person's current age, and their remaining years of life.  The older we get, the fewer years we have left to live, but we cannot perfectly predict one from the other.

-CM

I appreciate your being picky; that's how we get clarity

I'm glad the updated version is closer; I will edit it to bring it into alignment with the points you made--they are well-taken, and the examples will serve much better for teaching when they are clarified.

I hope that in addition to the details of r being discussed here, that people are seeing what the principle of "no blame, no shame" means in practice.

All of us, no matter how hard we've worked, have areas where we can continue to learn, and there is no shame in not already knowing everything perfectly.

We've come to this place via different paths, and we have complementary strengths where we can help each other out.

Don't be afraid of making errors in your learning process; we're all on the same side here, and we'll help each other figure things out.

 

In principle, I agree

Also, I sometimes wonder if the fact that we're not taught research literacy skills has lead to poor quality studies.  Take this abstract, for example (one of the studies used).  I don't know about anyone else, but not even having the correct spelling for Therapy, never mind "stress" doesn't scream "high quality" to me.

 

It's quite reasonable and a very good guiding principle to ask "if they can't get the little things right, how can we trust them to get the big things right?".

If you see elementary errors being made, then you have to wonder what's going on with the team and this research that led to those errors making it all the way through the process into print.

However, these misspellings were so bad that they were over the top. I find it hard to believe that--even if the team were so harried and rushed by deadlines that this slipped through their process, that reviewers/editors at the journal would let something *that* egregious into print.

I count:

  • theraphy for therapy twice
  • givn for given
  • cotisol for cortisol
  • stree for stress
  • earlt intervention for early intervention
  • senssory stimulation for sensory stimulation
  • P.O. Bos for P.O. Box

just in the abstract.

That's a level of carelessness I find pretty unbelievable, and so I suspect the explanation lies elsewhere than the authors of the paper.

I got the paper from the library this morning, and here's a screenshot of the abstract:

 

The mistakes aren't in the paper; the publisher introduced them into the record pointing to the paper. I suspect it happened when someone used a scanner and an OCR program to make their record, but did not bother to proof it after the OCR program had run.

Comments on the Moraska paper....

I'm just going to throw this on here to highlight some things on the Moraska paper:

From that paper:

Sorry for the poor quality.  This was also on it (and it's probably better if you look at the page on the original paper)

 

And also this:

So what is going on here?

Well, here are some things that I thought were interesting :

This paper took studies that had been published and re-listed information. 

  • There was really not much criticism of the original studies and it failed to highlight the fact that between group analyses were not carried out in the original studies.
  • To me, the study *could* be misleading.  A glaring example of that is there are two codes for the p value at the bottom of the main table (p<0.05 and and other one for p<0.01).  Remember that the p value is only yes or no statistic - it's just a cut off point.  It is not a measure of effect.
  • Another thing that stood out is that they mention a Moyer study (a meta-analysis) and that their suggestion was that the "inferential meaning" was because they only looked at studies that had trained therapists and they did it later, so included more studies.  I don't think that really hits the nail on the head.  What is missing is the fact that even though most of the studies here were RCTs, only within group analyses were made.
  • We (massage therapists) should realize that there is a big difference between "a review of the literature" and digging into the data. 


There are other things that I could say about that paper, but I'm hoping others will chime in....yes, I live in hope!

On another note, this is a really good synopsis of biases in research that everyone should be aware of.

- Rosemary

 

 

The bad spelleng in th abstrict

I'm glad it wasn't in the main paper. 

I'm going to throw something else out there as a way of trying to get someone to comment on at some of the other studies!

Since the Moraska paper used trained therapist and no dissertations in the inclusion criteria, but it did include studies that had pre and post data, can you see how easily is it to identify research that made it into one search using one type of inclusion criteria, versus the other (in the Moyer study), and does everyone understand why there are ones listed on the Moraska study that didn't qualify to make it into the Moyer study and vice versa?

Can you also see that this can be an aid in some way to a therapist who wants to try to figure at least some information out for him/herself? 

OR do you see it as just a bunch of confusing information? 

-Rosemary

Studies common to both

Seven studies are common to both.  So this means that seven have enough data for control group comparions, had trained MTs involved and weren't dissertations.  Now, I'm not saying that every therapist would view this the same - this is just me choosing which studies I think make sense to me, as an MT.

Out of those 7 there was a major overlap in the names researchers involved.  Does anyone see that as an issue?

Only three of those studies used saliva testing.

This is from one of those studies:

So even though no one commented on the 30 minutes beforehand in the Juvenile Arthritis study, you can see that the timing is different in this one, and it makes more sense to me.

But if you look at the study further, you'll see that there was no mention of what they did before the massage.   Taking that measurement right before the start of the massage is a reflection of state 20 minutes prior to that time.

Also, it was a 15 minute chair massage that was administered in this study.  Now, I'm not saying there is anything wrong with that, and I do chair massage sometimes in corporations, but would most therapists not view it differently from table massage, where it is skin on skin?  Also, with chair massage there is usually very little (if any) interview process, information gathering or anything else - so although it might give me some information, it's application to my every day work is somewhat limited.

If you look at this study on the forest plot on the Moyer study, you'll see that the effect size does not have statistical significance - no big surprise there. 

So that is one of the 7.  As for the other 6?  Well, since I don't have the full text right now, I cannot look at them in any detail.  What I can do is try to find out the validity and level of exactness in using urine analysis. 

 

-Rosemary

 

hmmmm.....

well, it looks like I'm writing to myself here.....

So I'll not bother putting anything more in here on this study, unless someone has a question?

It may be that there aren't enough people on the site yet or it could be that people just aren't interested.

Here's the thing though - having the main author available for questions probably won't happen that often.  If anyone has a question, you should ask it. But if everyone already knows what MD is on the Moyer study on the table 2, then we're OK, right?  Everyone knows why the urine stat testing was different from the saliva one, right?

Or it might be that nobody cares. 

Anyway,  it was a cool experience and I hope someone somewhere got something out of it! 

I hope next months discussion gets more interest.

ciao!

-Rosemary

 

 

 

I don't think it's true at all that nobody cares

 

well, it looks like I'm writing to myself here.....

 

No, I would not agree. I have been reading what you write, and I am very pleased with how much you have done, and the ways in which you are thinking.

 

So I'll not bother putting anything more in here on this study, unless someone has a question?

 

You have already done a lot of work. If you don't put anything else in, you've still done way more than I expected.

You've done a great job inaugurating Journal Club, and you've impressed me with all the work you did.

 

It may be that there aren't enough people on the site yet or it could be that people just aren't interested.

 

I think it's early days, and people are still trying to figure out what the site is, and what's going to be here.

And I also think that people get here from many different backgrounds, and they may think that skills in math or endocrinology are absolutely necessary, and that they'll be judged if they don't already have those skills.

But from people's responses, the very last thing I would think is going on is lack of interest.

 

Here's the thing though - having the main author available for questions probably won't happen that often.  If anyone has a question, you should ask it.

 

True. It's a very good opportunity.

On the other hand, what he works on is rather nuanced. It might be that we need to lay a little more groundwork first, and then proceed to the more nuanced work. I'm willing to try different things, and respond to people's feedback about what works, to make sure that people are getting the information they want and need from Journal Club.

 

But if everyone already knows what MD is on the Moyer study on the table 2, then we're OK, right?  Everyone knows why the urine stat testing was different from the saliva one, right?

Or it might be that nobody cares. 

 

Again, I think you're being a little hard on yourself. That there hasn't been more discussion doesn't mean that no one cares, nor that you haven't communicated with them.

Journal Club is still finding its voice, and I would not interpret too much from the first time. People are still joining the site, and reading, and learning.

We'll try different things and see what people want to talk about.

Perhaps one month is a little long, as well. Perhaps a 2-week period would be a better match--that is something we can try as well.

 

Anyway,  it was a cool experience and I hope someone somewhere got something out of it! 

 

I know for a fact that people have gotten something out of the discussion.

 

checking in

Hi, all--

Just wanted to check in with you about what you'd like to see here.

rchunco has expressed her concerns that people are not interested or don't care, because there hasn't been much discussion lately.

I don't think that's the case at all.

But I think it may be the case that the first time we try something, we find some things that need to be tweaked and adjusted a bit. If there are, then we'll make those changes--there is nothing wrong with making a mid-course correction as needed. If you've seen how sailboats move, then you'll know what I mean by tacking in one direction or another, until we find the right course.

I notice that there was more discussion going on a little while ago than there is now. Is one article per month perhaps too slow a pace? Would two weeks per article be a better speed, and would people have more to say in that case?

Is there anything else we should be taking into consideration?

Journal Club is for you, so if you have any ideas how it can meet your information needs, I hope you feel free to speak up and give your feedback.

You are not wasting your time.

I'm just now checking back after several days absence. I haven't commented or even asked questions because this is all so new & foreign to me, it's quite a bit to absorb. It's focusing on details I would never even begin to know to think about and there is a *lot* there. A lot.

No, you absolutely are not wasting your time. And it's here for as long as this site is here for anyone to read, now and later.

I've probably absorbed, oh, maybe 10%. This will take a little time to sink in. That's okay. Everyone has to start somewhere.

More simple studies

I think Adrienne is right in that this one was probably too much to take in on the first go for us.

But, if hopefully something might be learned, for example:

- it's not enough to just take something at face value.  A little bit of investigation by yourself can throw a new light on a study.

- you don't need to know the guts of statistics, just what they're used for, the question they are aiming to answer and how to interpret them.  Just as you don't need to know how your cellphone works, you just need to know how to use it. 

- identifying what is missing from the statistical analysis should always be considered, but there is no rush in learning about this.  (We can take our time  )

- when a control group is used, comparisions should be made to that group, and effect measures and confident intervals should be measured.

- Peer review and letters after names does not necessarily mean things are perfect. 

- Massage therapy doesn't have the same meaning to everyone, so you have to pick and choose according to what your definition of it is.  (At some stage there *might* be some definition of it for the research world, but personally I doubt that will happen any time soon - if at all, but that's just an opinion).

- The level of transparency in research varies, and there is also the fact that we cannot get our hands on the full abstracts for all studies.  We cannot count on abstracts alone to have any decent level of analysis. 

- Inclusion criteria for "research on research" is very important.

- Qualitative assessment isn't always done on the research on which the "research on research" is done.  We should be able to identify when that is the case.

- Sometimes the question does not match the methodology.  Even before the statistics are looked at, this should be considered.  I hope that a future article in the journal club will highlight this (it's much better to take just one study and break it down with some level of detail - since this study had included so many, it could have been confusing).

okey doke!

Rosemary

one other little thing

This little document might be good for some of us as an intro to critical appraisal.  

-Rosemary

I think this has been fine,

I think this has been fine, though I realize I'm not the journal club's intended audience.  And as was noted a while back, I'm sure there are people reading who are not commenting but who are still taking something away from this.

I agree with the idea that the next selection for your journal club should be something very straightforward in terms of research design.  That will give folks who are just beginning to learn about research more opportunity to jump in, maybe.

-CM

Not a bad place to start

I think this was a great place to start for several reasons. It's relevant to all MTs, it's important in that it challenges one of the more commonly held beliefs in MT, the basic point of the study is not that difficult to understand, and in trying to understand it, it introduces many different things that need to be taken into consideration. And, how often will we have direct access to an author of a study?

And - there's a lot going on here. It serves to illustrate how much most of us have to learn!

So, I think the idea of looking at a study that might be less of a big task would be good. Doing this one made it clear that might be a good next step.

It's going to take me awhile just to digest what is here but trust me, I appreciate it. This whole discussion is a tutorial.

One thing to consider....

If someone reads a study and they don't know what something means in it, they should ask "What does this mean......?" since that is one of the aims of the JC. 

Raven - it might also be an idea to have a "put up an abstract and lets translate it into English" thread somewhere since I think that would be a start, highlighting some of the research lingo and also highlighting the study type. 

I'll be honest with you in saying that sometimes I wonder about how this all will be used.  For example, 100 case studies that show showed improvement after massage for some condition or other is not good evidence that massage works for the condition, and neither would 1000 case studies.  That needs to be stressed.  I've often seen "but there is lots of research that show that blah blah blah works", and it's obvious that people are only looking at abstracts and not taking the type of study into consideration. Threats to validity is a biggie.

I doubt that most people would know where this study lies on the evidence pyramid.  I think I should have mentioned that at some stage.

anyway, live 'n' learn.

-Rosemary

Expand this point?

"For example, 100 case studies that show showed improvement after massage for some condition or other is not good evidence that massage works for the condition, and neither would 1000 case studies."

 

Would you be willing to explain why that is so, Rose?  Illustrating this point could be very valuable to some readers.

-CM

Rather than blab on about it

Here is a link that I think might be useful for some people.

I think part of the issue is that people think that a case study has a smidgen of evidence, and if there are 100 or 1000 times that smidgen then that cumulative value can be high, but that's "not the case"smiley.  It's not cumulative.

BUT I also think that in our practices case studies can be very useful for sharing information on therapeutic decision making - and making them open to critical appraisal - plus there are a lot of other reasons why we need them (covered elsewhere on this site, no doubt). 

The pyramid has been criticized by some (and some other models put forward as more applicable to our work) but the underlying reasoning should be understood by everyone before they consider the other models.  Misuse of the lower types of studies for *marketing our industry* makes me cringe - since it's not only taking advantage of MTs being ignorant of the reasoning behind the pyramid (e.g. "case studies have shown that modality x works - yay- come take a class", when more rigorous studies might show that it is has no therapeutic effect) but also the public (rehashed to "research has shown that modality x works - buy a session off me"), and I know this does and will continue to happen.  Taking advantage of someone's ignorance for our own gain might be the way the world works, but it doesn't mean it's right.

Yeah, so when people say "there's lots of studies that show that x works", they're not taking into consideration the type of studies or the quality of the studies either.  Also, if you look at this thread, everyone at the beginning just took this study and said "oh look, the cortisol claim has been over-turned", without looking any deeper and asking any questions about this study.  It was pure acceptance of this new study.  That's worrisome.  It shouldn't happen.  But that's just me!

Oh well! 

-Rosemary

 

 

 

 

 

 

 

Journal Club participation

First off, I want to thank Rose, Raven, Christopher and all the commentors on this thread.

Rose, your comments have been extremely helpful, well thought out, and show that you have spent a LOT of time on this. A huge and hearty thank you for taking the time to dig in and review this study in such detail. Please do not lose your motivation when people do not respond or comment with as much enthusiasm or intensity as you show when you comment. Your work is appreciated and, in my view, much needed. This site is one of the few (if only) sources of MT study of the research that is out there and it is a training ground for MTs on how to read and analyze research in general as well as the studies that are out there on massage therapy.

That all said, it does take time to read and comment and digest all this so comments will likely trickle in over time. (I just finished reading through this thread today and it's nearly the end of October.) I hope more of us will contribute to POEM, both financially and through commenting and eventually conducting research ourselves. I think Journal Club is such a great way to teach MTs research because we are starting with picking apart existing studies so that any research we conduct or review in the future will more likely avoid some of the errors made by those that went before us. Science and research is always evolving, never completely "done."

As to the article under review in Sept Journal Club, I want to point out that it has not "proven" that massage does not reduce cortisol. What it does say is that "The claim that massage reduces cortisol is brought into question. " In just a cursory review of some of the articles published regarding the effect of massage therapy on cortisol, my reaction is, we need to conduct more research on this claim before we can definitively say that massage therapy does not reduce cortisol. Correct me if I am wrong. (I often risk being wrong--it allows me to learn!)

We need to be very careful about how we present research questions and claims. For example: Does massage therapy have an effect (at all-reducing or increasing) on cortisol levels? When does this effect occur (During after, how long after)? How long does the effect last? When is best to measure the effect on cortisol levels (1 min after massage, 5 minutes after, an hour, 3 days)? Does the type of massage performed have a different effect on cortisol levels (Clothed, unclothed, with lotion, chair massage or on a table, sports, trigger point work, Swedish, myofascial)? How do we operationally define massage therapy in studies on the effect on cortisol? Does the type of person administering the massage have an effect on cortisol levels (trained, untrained, male, female, familiar, unknown, etc.)? Does the duration of the massage session affect cortisol levels differently? Is there a difference between effects on cortisol for those receiving multiple massages over a period of time or one single session of massage? Does the level of pressure used have an effect on client's cortisol levels (A big question and a difficult one to measure objectively)? Some of these questions could be answered with a study all by itself. Or perhaps a few studies could incorporate some of these questions if they were carefully constructed.

Now some may say, "oh my gosh, this is far too overwhelming! It's too much to think about and who on earth has time and energy and the training to conduct such research?" I know I've had this reaction at times. But then I get excited. I think, wow, what an awesome time to be a massage therapist and have the opportunity to learn about resaerch, maybe conduct some of our own and perhaps even help change the profession, increase our credibility, provide other MTs with substantative information about what we do. What an exciting time to be in the field! (I'm hoping more people have the latter reaction.)

I hope that after reviewing some of these studies, there are those of us in this group that will be able to conduct some new studies. While we can be grateful to Tiffiany Fields and others in the early days of massage research, many of the MT studies out there will not live up to rigorous scientific scrutiny.  Let's give our fellow MTs something more substantial to chew on and use as resources to back up claims about the health benefits of MT. I hope over time that as we all grow stronger in our understanding of research methods, we can substantially contribute to the research available.

OK, off my soap box. Again, thank you, thank you, thank you all!

Sue Shekut, L.M.T.

An excellent summary

 

Thank you, Sue. You've summed up a number of important points very well.

 

Rose, your comments have been extremely helpful, well thought out, and show that you have spent a LOT of time on this. A huge and hearty thank you for taking the time to dig in and review this study in such detail. Please do not lose your motivation when people do not respond or comment with as much enthusiasm or intensity as you show when you comment. Your work is appreciated and, in my view, much needed.

 

Seconded, absolutely. I am very pleased with the work Rose has put into this, including as webmistress as well as in the discussion, and I am blown away by what the other commenters have brought in as well.

It's early days, and we're building something that will last for decades, so I'm pleased with the participation to date. It will certainly add up, as well as picking up over time, as we get familiar with doing this. Remember that we're doing something new here, and we all have to figure it out.

In addition to the intellectual exercise going on, I'm also pleased at how the community is taking shape. It's very kind of you to reach out to Rose like that, and I think the supportive community aspect of POEM is a very important part of the site as well.

 

That all said, it does take time to read and comment and digest all this so comments will likely trickle in over time. (I just finished reading through this thread today and it's nearly the end of October.) I hope more of us will contribute to POEM, both financially and through commenting and eventually conducting research ourselves.

 

I won't shut down threads just because "time's up". Think of it more like there's a "featured" article or set of articles each month, but discussion can occur on any article at any time. Comments are welcome even years later--remember, some people haven't even found POEM yet, so they haven't had a chance to comment.

What I will do is mine the comments for content for the e-Books, so I'll have markers that say things like "all content before this point has been adapted for the human systems e-Book", or something like that. But that's not the end of the discussion by any means; it just means that if more discussion happens later, I'll know where to pick up working with the content, rather than redoing what was already done.

 

I think Journal Club is such a great way to teach MTs research because we are starting with picking apart existing studies so that any research we conduct or review in the future will more likely avoid some of the errors made by those that went before us. Science and research is always evolving, never completely "done."

 

A very important point, which you've made very well. This exercise is not just about nitpicking, it's to help make future research efforts even better.

I think that that point might not be widely recognized, so it might look more negative than it is. But it's important to remember why we do it, and you've stated it concisely and clearly. 

 

As to the article under review in Sept Journal Club, I want to point out that it has not "proven" that massage does not reduce cortisol. What it does say is that "The claim that massage reduces cortisol is brought into question. " In just a cursory review of some of the articles published regarding the effect of massage therapy on cortisol, my reaction is, we need to conduct more research on this claim before we can definitively say that massage therapy does not reduce cortisol.

 

It's an interesting situation.

We've started discussing, but have not yet fully explored, the concept of "burden of proof". Part 1 of the discussion is here; there will be more later.

The gist of it is that positive claims ("Treatment X has Y effect") and negative claims ("Treatment X has no effect") are not treated exactly the same way. The first statement has the risk of false positives, when we mistakenly think that something has an effect that it does not have.

The second statement has the risk of false negatives, where we mistakenly miss a real effect that something does have.

Since there's no way we can always get it exactly right, and we have to choose whether we'd rather live with false positives or false negatives, textbook science--the kind that students study, as opposed to frontier science, where scientists are still trying to figure out what is actually happening--tends to favor false negatives over false positives.

It can be frustrating in its conservatism, but there are advantages to favoring false negatives over false positives, and the idea is that if something really does have an effect that we've missed, the truth will eventually come out. If we favor false positives, on the other hand, we soon get lost in all the possibilities, and no longer can sort out what's really going on from what's a false positive effect.

So, as a positive claim, "massage does reduce cortisol" had a higher burden of proof than the corresponding negative claim, which would be "massage does not reduce cortisol".

It looked like that result was so consistent over so many studies that the claim had met its burden of proof, so people started to cite it as though we were sure it was true.

What Moyer et alia ("and all", meaning his team) pointed out was that the Field team's calculation was done in a non-standard way that increased the risk of false positive. Since so much of the research has been done by the Field team, then so much of what was assumed to meet the claim's burden of proof is now shown to be weaker than thought.

Since the positive claim needed that evidence, because it had the higher burden of proof, then this does strengthen the negative claim, that massage does not reduce cortisol.

It doesn't mean that massage doesn't work, only that the cortisol explanation does not account for massage effects.

But it's interesting, because so much of the research has been carried out by one team, that if that team has made a mistake, the uncertainty about it affects so much. I, too, would like to see more research carried out by more teams, just to reduce this kind of risk.

 

Correct me if I am wrong. (I often risk being wrong--it allows me to learn!)

 

That's the spirit!

This is a safe place to learn, and that means taking the risk of being wrong. But while factual corrections are permitted, they will never be allowed to become personal attacks or insults, because we want to preserve this as a safe place for everyone's learning.

 

Now some may say, "oh my gosh, this is far too overwhelming! It's too much to think about and who on earth has time and energy and the training to conduct such research?" I know I've had this reaction at times. But then I get excited. I think, wow, what an awesome time to be a massage therapist and have the opportunity to learn about resaerch, maybe conduct some of our own and perhaps even help change the profession, increase our credibility, provide other MTs with substantative information about what we do. What an exciting time to be in the field! (I'm hoping more people have the latter reaction.)

 

It can be overwhelming--but remember, none us of is alone here! All of the things you said are true, and we're here for each other along the journey. So if one of us has trouble with some concept, there are others who can and will help. And then someday the learner will be the one to share with a future learner. So the initial overwhelmingness of it begins to disappear when we work together.

 

I hope that after reviewing some of these studies, there are those of us in this group that will be able to conduct some new studies. While we can be grateful to Tiffiany Fields and others in the early days of massage research, many of the MT studies out there will not live up to rigorous scientific scrutiny.  Let's give our fellow MTs something more substantial to chew on and use as resources to back up claims about the health benefits of MT. I hope over time that as we all grow stronger in our understanding of research methods, we can substantially contribute to the research available.

 

We're very happy you're here, and look forward to your participation in the discussion.