The Internet is the largest searchable medical library in the world. It is also becoming an increasingly important and influential source of health information for the public. A recent Harris Interactive survey estimated that 97 million adults used the Internet to look for health-related materials. Studies also indicate that consumers are paying attention to what they find; more than 70% say online health information has influenced a treatment decision.
Consumers looking for health information on the Internet face a vast array of choices--tens of thousands of health-related Web sites and millions of Web pages of health-related material. How will this wealth of information influence the way that patients interact with physicians and the way that health care delivery is structured?
Optimists predict that ready access to the latest scientific information will allow consumers to participate more actively in their own care, perhaps even eliminating some disparities in access by allowing people to bypass the medical care system. Others worry that the Internet contains incomplete and misleading information, which may directly harm patients by misdirecting a treatment decision. Several studies have documented substantial variability in health-related Web site content for specific diseases. However, there have been no systematic assessments of whether the available web-based information is sufficiently complete and accurate to support consumer decision-making.
Recently several groups have become concerned with ensuring that the health information a consumer finds on the Internet is of high quality. A number of organizations have proposed, published and voluntarily implemented criteria to guide the evaluation of health-related Web site content. However, these criteria have not been systematically applied to a broad set of Web pages and to multiple medical conditions.
In response to concerns about variation in quality of health information on the Internet, the California HealthCare Foundation asked RAND to design and conduct a large study to that described and evaluated English and Spanish health information on the Internet. This study represents a major step forward in rigorous evaluation of web-based health information.
The study addressed three questions:
The study focused on four medical conditions (breast cancer, childhood asthma, depression and obesity) in two languages (English and Spanish). These conditions affect diverse populations and are associated with an increased likelihood of early death and disability. Thus they represent the kinds of health care concerns that would motivate consumers to look for information on the Web. A shorter version of this report is also published in the May 23, 2001 issue of the Journal of the American Medical Association (JAMA, May 23/30, 2001-Vol.285, No. 20).
Study Design
Evaluating Search Engine Performance. In our assessment of search engine performance, we focused on two basic issues:
We studied ten English-language and 4 Spanish-language search engines, selected because of their popularity or their unique method of ranking sites. Using each search engine, we conducted a series of standardized searches using one simple search term for each medical condition. We categorized the results of these searches, including the characteristics of the top Web sites and the type of content found.
Assessing the Quality of Health Information on the Internet. In the second part of the study, we addressed three questions:
We selected 18 English-language health Web sites (6 general health, 12 condition-specific) and 7 Spanish-language Web sites (3 general health, 4 condition-specific). Six English-language general health Web sites were chosen based on popularity (they were ranked highly in 2 widely used Internet industry reports, Cyber Dialogue and PC Data Online). Content provided by one of the most popular search engines was also included. We selected condition-specific English-language Web sites and all Spanish-language Web sites to represent prominent examples of condition-specific sites from commercial, government and nonprofit educational organizations.
For each of the four medical conditions, we convened panels of 3-4 nationally recognized clinical experts and representatives from patient advocacy organizations. Each panel developed 5-7 essential, "need-to-know" topics and "consumer-oriented" questions, designed to reflect the concerns of patients or family members seeking health information. The questions were very basic, as illustrated by the examples below.
Examples of questions related to treatments for breast cancer:
Examples of questions related to the etiology and symptoms of childhood asthma:
Examples of questions related to treatments for obesity:
Based on reviews of the clinical literature, the panels then developed a series of standardized concepts (clinical elements that should be addressed) for the topics and questions. In effect, these concepts represented the kind of clinical information that the panelists thought consumers should find on a site. For example, the clinical elements for depression included the following:
After the topics and questions were developed, two trained searchers visited each selected Web site looking for information related to the consumer-oriented questions. The results from each search were saved and assembled into separate notebooks. All site identifiers were removed, and the notebooks were sent to physician experts to rate the quality of the material. Four rating forms were developed to help standardize the evaluation.
Readability Assessment of Health Information on the Internet. In the third section of the study, we determined the reading grade level at which information provided by these Web sites is written.
We applied widely accepted readability formulas to randomly selected passages of text from both English- and Spanish-language Web sites. These formulas measure grade levels as a function of the average sentence length and word complexity in text samples.
Key Findings
According to recent health literacy studies, the majority of many health consumer populations cannot understand material written at a 9th grade reading level. This means that most health Web sites require reading skills beyond the abilities of many consumers, especially those underserved populations who are most in need of this information.
Conclusions
Search Engine Performance. More than half of the consumers who use the Internet report that they spend about half an hour looking for health information, so efficiency is an important dimension of search engine performance. Search engines do provide users with an enormous amount of information, but most of it is not relevant to the search. Overall, just one in five links identified by 10 English-language search engines and one in eight links from 4 Spanish-language search engines led to a Web page containing content relevant to the search.
Quality of Health Information on the Internet. Although we retrieved more than 20,000 pages of material related to the "need-to-know" topics and "consumer-oriented" questions, we found substantial gaps in the availability of key information. For example, less than half of the Spanish-language materials explained that mastectomy and lumpectomy plus radiation are equivalent treatments for early-stage breast cancer. Few sites indicated that a woman with a persistent breast mass and a negative mammogram usually needs further evaluation. If consumers are relying on the Internet to help make important treatment decisions, such deficiencies in information could have serious consequences.
Readability Health Information on the Internet. Much of the health information available on the Internet is beyond the comprehension of many consumers. All of the English Web site documents assessed had materials that required at least a 9th grade reading level, and more than half presented material at the college level. Four of seven Spanish-language sites presented materials at the 9th grade reading level or higher.
Studies of patients in various clinical settings suggest that a 9th grade reading level is too high for most patients. The reading ability of patients and the readability of health-related information on the Internet must be more clearly matched before the Internet can become an effective medium for patient education or have any potential to reduce disparities in access.
Recommendations
These study findings suggest a variety of steps that consumers, providers, and policymakers could take to help the Internet achieve its promise of making critical health information available to all. Based on our analysis, we make the following preliminary recommendations.
To consumers
To consumer advocacy groups
To health care providers
To providers of Web-based health information
To policymakers and regulators
The Internet offers tremendous potential for consumers to easily access important information about their won health problems and those faced by friends and families. This study takes an important first step in conducting rigorous evaluations of Internet medical content. We hope our findings will be used to guide future improvements in the availability and quality of health information on the Internet.
Examples of questions related to the causes and treatments for depression:
Search engines are not efficient tools for locating health information on a particular health topic.
Consumers often find incomplete answers to important questions; however, the information that is provided is generally accurate.
Most Web-based health information will be difficult for the average consumer to understand.
Table of Contents
Acknowledgments