Subject Guides: Systematic Reviews: Developing Searches

Steps of the Search Process

As you begin your project, it is important to define a clear scope of your review. Craft your research question. Brainstorm the inclusion and exclusion criteria that you will use to screen sources. Start to identify keywords related to your question and criteria.

Chapter 2 in the Cochrane Handbook offers useful tips for determining the scope of a review: https://training.cochrane.org/handbook/current/chapter-02.

The Covidence Academy article, "How to formulate the research question using PICO. 5 steps to get you started" explains the PICO framework as one method for formulating an effective research question.

Before fleshing out your keywords, decide what databases you will search. You want to cast a wide net across several databases to ensure that you pull back all (or almost all) relevant articles.

One great strategy is to pull up recent systematic reviews on topics similar to yours and ones in journals you might choose to publish in. Tally which databases are most commonly searched and use this to inform your own selections.
- While you are at it, look for sample search queries in these systematic reviews.
- Take note of what limiters, if any, are used. Determine what record fields (e.g., title, abstract, index/subject terms) are typically searched. Use this information to make decisions about your own search queries later in the process.
The following resources offer comparisons of databases and may help you determine which databases will best fit your systematic review.
- This article reviews the capabilities of various search engines and databases.
- This Michigan State LibGuide describes how Google Scholar differs from human-curated databases like MEDLINE.
You can also look through subject guides to see which databases are recommended for a given discipline by subject librarians.

An A to Z list of all BYU database subscriptions can be found at https://lib.byu.edu/databases/.

The following table lists several databases commonly used in social and health sciences systematic reviews. The names on the left link to the database through BYU. The links on the right take you to various how-to guides. Databases share many features, but they also have unique elements such as their own search algorithms, search functions, and controlled vocabulary. Understanding these unique elements up front and how they work will save you time later.

Database Name	Database Links
MEDLINE (PubMed)	Building a Search in PubMed PubMed: Using the Advanced Search Builder Medical Subject Headings (MeSH) Using MeSH Note: You can also search MEDLINE on the EBSCO Platform.
Embase	Overview of Embase Search Options Embase Support Center Quick Reference Guide Embase 201:Systematic Reviews Using EmTree Comparing EmTree to MeSH
CINAHL	Advanced Search Tutorial Using EBSCO CINAHL User Guide CINAHL Help Sheet Info About CINAHL Subject Headings Using CINAHL Subject Headings
Scopus	Scopus Tutorials Note: Both Scopus and Embase are owned by Elsevier. For more information about similarities/differences, see "If I have access to Scopus, do I need Embase?"
Web of Science	Web of Science Help Web of Science Search Tips Web of Science Advanced Search
APA PsycInfo	Quick Reference Guide Advanced Search on APA PsycInfo (EBSCO) Info about APA PsycInfo's Thesaurus

A couple of things to note

At BYU, CINAHL, APA PsycINFO, and MEDLINE can be searched on the EBSCO platform, which means that you could search all three databases simultaneously. On the plus side, this saves time. On the downside, deduplicating results is trickier and you may have to adjust use of controlled vocabulary because they may differ between databases.
Most of these databases focus on peer-reviewed literature. If you are concerned about evidence selection bias (e.g., not pulling all relevant data because significant findings are more likely to be published than non-significant findings), you may also need to search gray literature and unpublished studies. See Paez (2017) for a viewpoint on why grey literature is important and how to use it in systematic reviews. See this King's College LibGuide about finding grey literature.

Reference

Paez, A. (2017) Gray literature: An important resource in systematic reviews. J Evid Based Med. 2017; 10: 233– 240. https://doi.orr/10.1111/jebm.12266

Developing your sets of keywords will be an iterative process. One way to keep track of your keywords is to create a simple table like the following:

Search Concepts	Sets of Keywords
Concept 1 (e.g., traumatic brain injury)	Set of keywords that represent Concept 1 (e.g., "traumatic brain injury" or TBI or "head injury" or etc.)
Concept 2	Set of keywords that represent Concept 2
Concept 3	Set of keywords that represent Concept 3
etc.	etc.

Each time you identify an additional synonym for a concept, add it to the appropriate set of keywords. Notice that each set is being built with "or" separating each word. This makes it easier to translate it into the database query later. In this example, multiple-word phrases are encased with double quotation marks. This is a common way to make sure databases search for full phrases rather than individual words. You might also decide to truncate words that would have multiple endings. This is often signified with an asterisk (e.g., injur* would retrieve injury, injuries, etc.). Usually you can use both truncation and quotations in the same phrase. However, because databases have their own search functions and algorithms, it is best to examine what works for each one.

Additional tips for identifying keywords:

Lead researchers on the project generally know the literature. Have them list keywords for each concept.
Run a search for each concept individually in one or more databases. Look for articles relevant to your concept. Then, pull new keyword variations from titles, abstracts, author keywords, and subject lists.
Find the list of controlled vocabulary for a database and search for each of your concepts. See what related terms are listed and pull those that are relevant.
If relevant, use the acronyms PICO (Population, Intervention, Comparison, Outcome) or SPIDER (Sample, Phenomenon of Interest, Design, Evaluation, Research type) to develop sets of keywords. See Cook et al. (2014) for a comparison of using these approaches for a qualitative evidence synthesis.

The scope of your review and the popularity of your topic will impact how many initial results you get from all of your database searches. Sometimes you get thousands. Sometimes you get hundreds. When possible, using database limiters can be an easy way of applying exclusion criteria from the beginning. Some potential limiters are the following:

Publication date range (e.g., if you are updating a previous review)
Language (e.g., if your research team will only be able to read articles in certain languages)
Source type (e.g., empirical, peer-reviewed)
Species
Sex

As you think about applying limiters, consider whether they will introduce bias into your process. For example, if you limit to only peer-reviewed articles, you may exclude relevant theses and dissertations. Also keep in mind that not all databases have the same limiters.

Also, depending on how you are tracking your process, you should note the number of results in each database before and after you apply limiters.

Remember to document exactly how you do your searches so you can convey the process clearly in your methods. The goal is replicability. Ideally, you want to run searches as similarly as possible across databases. However, your search may be adapted somewhat to accommodate the search functions and controlled vocabulary of each database.

The following table illustrates how to translate a search across several databases.

Database	Search Translation	Number of Results	Permalink to Search (if available)
	Basic Query: ("spatial working memory" or "visuo-spatial memory") AND ("traumatic brain injury" or "head injury" or "brain injury" or "brain damage")
MEDLINE (PubMed)	Query 1: When you enter the basic query above into MEDLINE, it will by default search all fields. Translated to MEDLINE's syntax, it would like: ("spatial working memory"[All Fields] OR "visuo-spatial memory"[All Fields]) AND (((("traumatic brain injury"[All Fields] OR "head injury"[All Fields]) OR "brain injury"[All Fields]) OR "brain damage"[All Fields]) Query 2: If you enter the basic query without quotations, MEDLINE will automatically map keywords onto its controlled vocabulary MeSH terms. Translated to MEDLINE's syntax, it looks like: (("visuo-spatial"[All Fields] AND (((("memory, short-term"[MeSH Terms] OR ("memory"[All Fields] AND "short term"[All Fields])) OR "short-term memory"[All Fields]) OR ("working"[All Fields] AND "memory"[All Fields])) OR "working memory"[All Fields])) OR ((((("spatial"[All Fields] OR "spatialization"[All Fields]) OR "spatializations"[All Fields]) OR "spatialized"[All Fields]) OR "spatially"[All Fields]) AND (((("memory, short-term"[MeSH Terms] OR ("memory"[All Fields] AND "short term"[All Fields])) OR "short-term memory"[All Fields]) OR ("working"[All Fields] AND "memory"[All Fields])) OR "working memory"[All Fields]))) AND ((((((("brain injuries, traumatic"[MeSH Terms] OR (("brain"[All Fields] AND "injuries"[All Fields]) AND "traumatic"[All Fields])) OR "traumatic brain injuries"[All Fields]) OR (("traumatic"[All Fields] AND "brain"[All Fields]) AND "injury"[All Fields])) OR "traumatic brain injury"[All Fields]) OR (((("brain injuries"[MeSH Terms] OR ("brain"[All Fields] AND "injuries"[All Fields])) OR "brain injuries"[All Fields]) OR ("brain"[All Fields] AND "injury"[All Fields])) OR "brain injury"[All Fields])) OR (((("craniocerebral trauma"[MeSH Terms] OR ("craniocerebral"[All Fields] AND "trauma"[All Fields])) OR "craniocerebral trauma"[All Fields]) OR ("head"[All Fields] AND "injury"[All Fields])) OR "head injury"[All Fields])) OR (((("brain injuries"[MeSH Terms] OR ("brain"[All Fields] AND "injuries"[All Fields])) OR "brain injuries"[All Fields]) OR ("brain"[All Fields] AND "damage"[All Fields])) OR "brain damage"[All Fields])) Note: You can set up the query to search specific fields (e.g., abstract instead of all fields) with each group of keywords.	Query 1 Results: 75 Query 2 Results: 582
Embase	Similar to MEDLINE, Embase can map your query to its controlled vocabulary: EmTree terms. The following reflects the baseline query entered into Emase's advanced search with the following mapping options selected: map to preferred term in Emtree, search also as free text in all fields,explode using narrower Emtree terms, and search as broadly as possible. ('spatial working memory'/exp OR 'spatial working memory' OR 'visuo-spatial memory') AND ('traumatic brain injury'/exp OR 'traumatic brain injury' OR 'head injury'/exp OR 'head injury' OR 'brain injury'/exp OR 'brain injury' OR 'brain damage'/exp OR 'brain damage') Note: You can set up the query to search specific fields (e.g., abstract) with each group of keywords.	263
CINAHL (EBSCO)	When using the advanced search in CINAHL, by default it uses the query to search titles, abstracts, and subject headings. You can set up the query to search other fields (e.g., abstract).	21	Permalink
Scopus	When using the document search in Scopus, by default it uses the query to search titles, abstracts, and keywords. The resulting query is the following: ( TITLE-ABS-KEY ( ( "spatial working memory" OR "visuo-spatial Working memory" ) ) AND TITLE-ABS-KEY ( ( "traumatic brain injury" OR "head injury" OR "brain injury" OR "brain damage" ) ) ) Note: You can set up the query to search other fields (e.g., abstract).	225
Web of Science	When using the basic search in Web of Science, by default it uses the query to search titles, abstracts, author keywords, and Keywords Plus. The resulting query translation is the following: TOPIC: ("spatial working memory" OR "visuo-spatial Working memory") AND TOPIC: ("traumatic brain injury" OR "head injury" OR "brain injury" OR "brain damage") Timespan: All years. Indexes: SCI-EXPANDED, SSCI, A&HCI, ESCI. You can set up the query to search specific fields (e.g., abstract) with each group of keywords.	143
APA PsycInfo (EBSCO)	When using the advanced search in APA PsycInfo, by default it uses the query to search abstracts, authors, keywords, classification codes, subjects, titles, and translated titles. You can set up the query to search other fields (e.g., abstract).	285	Permalink

One Thing to Note

Most databases allow you to combine searches. When using this method, you would run a separate search for each set of keywords (e.g., search #1 for brain injury, search #2 for working memory) and then use an advanced search function to combine the searches (e.g., #1 AND #2). Some may use this to exclude a body of research. For example, you could run a search for the topic sport, which would become search #3, and then do a combined search with the first 2 (e.g., #1 AND #2 NOT #3).

Time-Saving Tip

Before downloading or exporting the results from all database queries, it is useful to have a couple of team members assess whether they seem appropriate.

Look through the first page or so of results. Are potentially relevant and expected articles showing up?
Look at the sum total of results from all queries. Does the sum seem reasonable based on your knowledge of the literature or does the search need to narrow or broaden?

After running your finalized query in each database, you will need to download the records, combine them, and deduplicate them. You can use bibliography mangers, programs like Covidence, or a spreadsheet. The following table describes how to download records to an Excel spreadsheet.

Database	Instructions
MEDLINE (PUBMED)	Downloading MEDLINE records and importing them to Excel is not as straightforward as with other databases. The following instructions have you use RefWorks as an intermediary tool. After running the search, select the save button Under selection, choose all results Under format, choose PUBMED Save file Open RefWorks Select add Select import references Drag file from above into import box Click on share in Refworks Select all references from last import Select tab delimited as the format Click export Save the file Resave the file with the file ending .txt instead of .tsv Open an Excel spreadsheet Go to the data tab Select get external data from text Select the text file resaved from RefWorks In the sequence of steps, indicate the file is delimited by tabs Finish the import and save the file
Embase	Select all records in search using drop down menu under the “Results” checkbox Select export Under format choose Excel with fields as columns Under output, choose full record Click export, wait for it to load, and then click download
CINAHL	Set records per page to highest option Under share button, add page results to folder Repeat for every page of results Go to folder Select export to CSV Open CSV in Excel
Scopus	Under All, click “select all” Select export Make sure you click the box “Abstract & Keywords” Under method of export, select CSV Open CSV in Excel
Web of Science	Select export Select all records If the total records is more than 500, you will have to export 1-500, 501-1000, etc. Select full record Export to Excel
APA PsycInfo	Set records per page to highest option Under share button, add page results to folder Repeat for every page of results Go to folder Select export to CSV Open CSV in Excel

Tracking record numbers is important when it comes time to write the methods and results of your systematic review. Many publications include a figure like the PRISMA flow chart that tracks the number of articles originally found, the number after deduplicating, the number that met inclusion criteria, etc. The PRISMA checklist is also helpful when drafting content for your methods/results. The Record Keeping page in this guide addresses additional areas where you will need to track details of the research process.

Systematic Reviews: Developing Searches

Why Search Queries Matter

Steps of the Search Process