Evaluation of English and Spanish Health Information on the Internet

Executive Summary

Introduction

The Internet is the largest searchable medical library in the world. It is also becoming an increasingly important and influential source of health information for the public. A recent Harris Interactive survey estimated that 97 million adults used the Internet to look for health-related materials. Studies also indicate that consumers are paying attention to what they find; more than 70% say online health information has influenced a treatment decision.

Consumers looking for health information on the Internet face a vast array of choices--tens of thousands of health-related Web sites and millions of Web pages of health-related material. How will this wealth of information influence the way that patients interact with physicians and the way that health care delivery is structured?

Optimists predict that ready access to the latest scientific information will allow consumers to participate more actively in their own care, perhaps even eliminating some disparities in access by allowing people to bypass the medical care system. Others worry that the Internet contains incomplete and misleading information, which may directly harm patients by misdirecting a treatment decision. Several studies have documented substantial variability in health-related Web site content for specific diseases. However, there have been no systematic assessments of whether the available web-based information is sufficiently complete and accurate to support consumer decision-making.

Recently several groups have become concerned with ensuring that the health information a consumer finds on the Internet is of high quality. A number of organizations have proposed, published and voluntarily implemented criteria to guide the evaluation of health-related Web site content. However, these criteria have not been systematically applied to a broad set of Web pages and to multiple medical conditions.

In response to concerns about variation in quality of health information on the Internet, the California HealthCare Foundation asked RAND to design and conduct a large study to that described and evaluated English and Spanish health information on the Internet. This study represents a major step forward in rigorous evaluation of web-based health information.

The study addressed three questions:

The study focused on four medical conditions (breast cancer, childhood asthma, depression and obesity) in two languages (English and Spanish). These conditions affect diverse populations and are associated with an increased likelihood of early death and disability. Thus they represent the kinds of health care concerns that would motivate consumers to look for information on the Web. A shorter version of this report is also published in the May 23, 2001 issue of the Journal of the American Medical Association (JAMA, May 23/30, 2001-Vol.285, No. 20).

Study Design

Evaluating Search Engine Performance. In our assessment of search engine performance, we focused on two basic issues:

  1. What are consumers likely to find when they search for specific health topics online? Does the choice of a search engine affect the results of a simple search for health information?
  2. How easy is it to find relevant information?

We studied ten English-language and 4 Spanish-language search engines, selected because of their popularity or their unique method of ranking sites. Using each search engine, we conducted a series of standardized searches using one simple search term for each medical condition. We categorized the results of these searches, including the characteristics of the top Web sites and the type of content found.

Assessing the Quality of Health Information on the Internet. In the second part of the study, we addressed three questions:

  1. How comprehensive is the information on selected e-health Web sites?
  2. How accurate is it?
  3. How often do Web sites provide documentation that allows one to assess the source or currency of the material?

We selected 18 English-language health Web sites (6 general health, 12 condition-specific) and 7 Spanish-language Web sites (3 general health, 4 condition-specific). Six English-language general health Web sites were chosen based on popularity (they were ranked highly in 2 widely used Internet industry reports, Cyber Dialogue and PC Data Online). Content provided by one of the most popular search engines was also included. We selected condition-specific English-language Web sites and all Spanish-language Web sites to represent prominent examples of condition-specific sites from commercial, government and nonprofit educational organizations.

For each of the four medical conditions, we convened panels of 3-4 nationally recognized clinical experts and representatives from patient advocacy organizations. Each panel developed 5-7 essential, "need-to-know" topics and "consumer-oriented" questions, designed to reflect the concerns of patients or family members seeking health information. The questions were very basic, as illustrated by the examples below.

Examples of questions related to treatments for breast cancer:

Examples of questions related to the etiology and symptoms of childhood asthma:

Examples of questions related to the causes and treatments for depression:

Examples of questions related to treatments for obesity:

Based on reviews of the clinical literature, the panels then developed a series of standardized concepts (clinical elements that should be addressed) for the topics and questions. In effect, these concepts represented the kind of clinical information that the panelists thought consumers should find on a site. For example, the clinical elements for depression included the following:

After the topics and questions were developed, two trained searchers visited each selected Web site looking for information related to the consumer-oriented questions. The results from each search were saved and assembled into separate notebooks. All site identifiers were removed, and the notebooks were sent to physician experts to rate the quality of the material. Four rating forms were developed to help standardize the evaluation.

Readability Assessment of Health Information on the Internet. In the third section of the study, we determined the reading grade level at which information provided by these Web sites is written.

We applied widely accepted readability formulas to randomly selected passages of text from both English- and Spanish-language Web sites. These formulas measure grade levels as a function of the average sentence length and word complexity in text samples.

Key Findings

Search engines are not efficient tools for locating health information on a particular health topic.

Consumers often find incomplete answers to important questions; however, the information that is provided is generally accurate.

Most Web-based health information will be difficult for the average consumer to understand.

According to recent health literacy studies, the majority of many health consumer populations cannot understand material written at a 9th grade reading level. This means that most health Web sites require reading skills beyond the abilities of many consumers, especially those underserved populations who are most in need of this information.

Conclusions

Search Engine Performance. More than half of the consumers who use the Internet report that they spend about half an hour looking for health information, so efficiency is an important dimension of search engine performance. Search engines do provide users with an enormous amount of information, but most of it is not relevant to the search. Overall, just one in five links identified by 10 English-language search engines and one in eight links from 4 Spanish-language search engines led to a Web page containing content relevant to the search.

Quality of Health Information on the Internet. Although we retrieved more than 20,000 pages of material related to the "need-to-know" topics and "consumer-oriented" questions, we found substantial gaps in the availability of key information. For example, less than half of the Spanish-language materials explained that mastectomy and lumpectomy plus radiation are equivalent treatments for early-stage breast cancer. Few sites indicated that a woman with a persistent breast mass and a negative mammogram usually needs further evaluation. If consumers are relying on the Internet to help make important treatment decisions, such deficiencies in information could have serious consequences.

Readability Health Information on the Internet. Much of the health information available on the Internet is beyond the comprehension of many consumers. All of the English Web site documents assessed had materials that required at least a 9th grade reading level, and more than half presented material at the college level. Four of seven Spanish-language sites presented materials at the 9th grade reading level or higher.

Studies of patients in various clinical settings suggest that a 9th grade reading level is too high for most patients. The reading ability of patients and the readability of health-related information on the Internet must be more clearly matched before the Internet can become an effective medium for patient education or have any potential to reduce disparities in access.

Recommendations

These study findings suggest a variety of steps that consumers, providers, and policymakers could take to help the Internet achieve its promise of making critical health information available to all. Based on our analysis, we make the following preliminary recommendations.

To consumers

  1. The Internet health space is unimaginably vast, and navigating through that space is more challenging than you might think. Therefore, it is important to set aside adequate time for a search, and plan to visit several sites. Our searchers found that even finding material for a single consumer question took 10-15 minutes per Web site with a high-speed Internet connection.
  2. Be aware that sites will not necessarily provide a comprehensive picture of what you need to know about a condition. Web-based information can only supplement consultation with a health care professional. But sites may give you information that helps you ask good questions and understand more clearly what your doctor tells you.
  3. Many sites are created and maintained for commercial reasons. This does not mean that the information you find there is inaccurate, but it is wise to consider the possibility that someone is trying to sell you something by providing this information.
  4. Do not be surprised if you find information that is conflicting or difficult to understand. Ask a health care provider to help you interpret what you find.

To consumer advocacy groups

  1. Advocacy groups could help consumers by "adopting" one or two relevant sites and continually screening their content for coverage and accuracy. One might imagine commercial arrangements between e-Health companies (who would produce the content) and advocacy organizations (who would arrange for impartial review by experts).
  2. Advocacy groups could also help consumers by pressing for improvements in site content and presentation to make information more complete, accurate, and accessible, and by referring consumers to good sites.

To health care providers

  1. Be aware that the Internet exposes patients to a vast and often confusing array of information, sometimes helping, sometime hindering the process of providing good care.
  2. Professional societies could play a key role in organizing physicians, pharmacists or other professionals to provide a more formal interpretive function. In the current health business environment, this may require attention to the mechanisms by which such services could be reimbursed.
  3. Specialty societies could provide key clinical content for Web sites. Their involvement could significantly improve coverage, accuracy, and presentation of the material. Specialty societies should work with advocacy organizations to ensure that the material they develop addresses important patient concerns and facilitates patient decision making about when to seek medical care.

To providers of Web-based health information

  1. Web site content providers could commission clinical panels of experts to review coverage, accuracy, and factual conflicts before material is put before the public. They could also work with consumer advocacy organizations to make sure that frequently asked consumer questions are addressed and that language used is appropriate for the intended audiences.
  2. Information on consumer-oriented health-related sites should be provided at lower reading grade levels. Currently, the US Department of Health and Human Services recommends that patient education materials not exceed a 6th grade reading level. A serious effort should be made reach this standard.
  3. Clinical content should be periodically reviewed by experts, and such reviews should be required in all quality assessments of health-related sites. Reviews should be conducted under the auspices of an independent party that is unrelated to the Internet health information provider and does not provide such information itself.
  4. Readability standards should also be part of quality assessments of health-related Internet sites. Because readability is an integral part of providing information, explicit standards should be articulated and the methods used for assessing readability should be made public. Readability assessments of materials on Spanish- and English-language Web sites should be made on a regularly basis, and results should be widely disseminated in both languages to help consumers choose sites that will best meet their needs.

To policymakers and regulators

  1. The best English-language Web sites are far better than the best Spanish-language sites. Major gains in quality of Spanish-language information (and very likely other non-English language information) can be achieved by translating and culturally adapting what is now available in English.
  2. Continue to fund high-quality sites, especially for consumer audiences who are not well served by free market forces alone.
  3. Put greater effort into publicizing and increasing access to high-quality government sites.
  4. Fund research on how to effectively communicate health information in Web-based format to readers with a wide range of reading levels.

The Internet offers tremendous potential for consumers to easily access important information about their won health problems and those faced by friends and families. This study takes an important first step in conducting rigorous evaluations of Internet medical content. We hope our findings will be used to guide future improvements in the availability and quality of health information on the Internet.


Table of Contents
Acknowledgments