2 Technical terms

2.1 abstract

An abstract is the short summary at the beginning of scientific publications, e.g. journal articles, theses or conference papers. The abstract is among the most important parts of a publication as it is usually the only part of the text which is freely available and can be retrieved from bibliographic databases. Title and abstract are the key elements of a textword search. (Cals and Kotz (2013), Pitkin and Branagan (1998))

2.2 ambiguity

A term or statement which has more than one possible meaning or definition is ambiguous and can exist in very different contexts.

Ambiguity is often caused by polysemy, i.e. the fact that one expression can have more than one meaning, or by homonymy, in which independent words possess identical spelling, but bear different meanings.

Ambiguity proves to be an obstacle in creating search strategies, because any free text search with ambiguous terms will inevitably retrieve records for all meanings of the term regardless of the context. In this case, it is an option to render such terms more specific by using phrases or proximity operators.

Examples

The acronym CVD is a very common abbreviation for cardiovascular disease. However, it can also mean cerebrovascular disease, chronic venous disease, color vision deficiency or chemical vapour deposition.
The term pharmacy may refer to the pharmaceutical sciences, the manufacture of drugs or the pharmacy as a retail shop.
In anatomy, a styloid process is a pointed outgrowth from a bone. However, there are serveral of these in the human body, for instance in the temporal bone, the radius bone, the ulna bone or the metacarpal bones.
The search term crab might lead to articles about one of two very different kinds of arthropods: True crabs (Brachyura) or crab lice (Phthirus).
discharge could mean the releasing an inpatient from a hospital or in another context the releasing of an electric charge.
lead can be the verb to lead, which can mean to direct something or someone with authority, or the noun lead, which can mean various things, such as a heavy metal element (Pb) or the thin graphite cylinder within pencils (“pencil lead”).
dialysis is a chemical technique of separating molecules in solution. It is also the colloquial term for renal dialysis, a medical treatment using the aforementioned technique to filter blood and thus temporarily replace the kidney function.

2.3 appendix

An appendix provides additional content to a publication and is often called supplementary material or supplementary information. Usually any content which goes beyond the constraints of the publication is provided there. Examples are the full search strategies of systematic literature searches, detailed descriptions of methods or measurements.

It is also possible to publish such supplementary information or research data independently in online repositories.

2.4 author keyword

Publishers often require the authors of a publication to provide keywords describing the content of the publication. These keywords don’t necessarily correspond to controlled vocabulary (i.e. index terms, which are assigned to the records by the database) and should not be confused with them. (Névéol et al. (2010))

Author keywords are often used as part of the free text search. (DE: Stichwortsuche)

Examples

Within PubMed the data field [Other Term]) can be searched for author keywords. This field is automatically searched when [Title/Abstract] or [Text Word] are used. (See PubMed Search Field Tags).

In Ovid MEDLINE the corresponding field codes .kw or .kf can be used for searching the author keywords.

2.5 bias

Bias is a systematic deviation between results and facts that may lead to under- or over-estimation of intervention effects. As a result bias might lead to conclusions which do not accurately represent the truth. There are various types and sources for bias, as well as methods to avoid some forms of bias, such as randomization or blinding of participants in clinical trials.

Systematic literature searching is a means to reduce bias. Systematic reviews strive to minimise these deviations by assessing the risk of bias in results of included studies. (Braun et al. (2021), Boutron et al. (2022), Gough et al. (2017))

2.5.1 selection bias

Systematic differences between comparison groups lead to a deviation from the true effect of an intervention. This so-called selection bias can be prevented by measures such as the sufficient randomization and concealment of allocation of trial participants to different study arms.(Gough et al. (2017), Odgaard‐Jensen et al. (2011))

2.5.2 publication bias

Results perceived as “positive” are more likely to be published than those results which are perceived as “negative”, which gives the positive results more weight. (Gough et al. (2017))

2.5.3 language bias

In systematic literature searches it is often tempting to restrict the search to languages which are easily understood by the screeners and researchers. Not only will this practive keep the number of records at a more manageable level, it also makes the translation of foreign publications unnecessary. However, this so-called language of publication bias can have a significant impact on the quality of the review. (Gough et al. (2017), Morrison et al. (2012), Moher et al. (2003))

2.6 BibTeX

BibTeX is a bibliographic file format which is supported by most reference management programs, in particular JabRef, which supports it natively, and it is often available as an export format in bibliographic databases and on publisher websites. BibTeX files possess the file extension .bib.

Example: BibTeX format

@article{watson2020,
  author   = {Watson, Mandy},
  journal  = {Br J Nurs},
  title    = {How to undertake a literature search: a step-by-step guide},
  year     = {2020},
  note     = {PMID: 32279549},
  number   = {7},
  pages    = {431--435},
  volume   = {29},
  abstract = {Undertaking a literature search can be a daunting prospect. Breaking the exercise down into smaller steps will make the process more manageable. This article suggests 10 steps that will help readers complete this task, from identifying key concepts to choosing databases for the search and saving the results and search strategy. It discusses each of the steps in a little more detail, with examples and suggestions on where to get help. This structured approach will help readers obtain a more focused set of results and, ultimately, save time and effort.},
  doi      = {10.12968/bjon.2020.29.7.431},
}

BibTeX is also the name of a software package used to typeset references in LaTeX. The references of this compendium are also managed and processed using the BibTeX format.

More information about BibTeX can be found at https://www.bibtex.org.

2.7 bibliographic database

Bibliographic databases comprise references to publications, such as articles in peer-reviewed journals, reports, patents, book chapters or conference proceedings.

As opposed to full text databases, bibliographic databases only provide bibliographic information or metadata. Bibliographic records typically include title, abstract, author(s), publication year, journal name, and the DOI or other persistent identifiers.

References in bibliographic databases are often indexed with subject headings to facilitate the retrieval of relevant records, for instance during a systematic literature search.

2.8 classification metrics

In information retrieval the performance of a systematic search, a search strategy or a search filter is described by certain metrics for binary classification tasks, such as accuracy, precision, sensitivity and specificity.

A classifier (the search) makes a prediction about the condition of a record (by retrieving or not retrieving the record). The classification (the search result) is evaluated by comparing the prediction with the actual condition (the relevance of the records).

In other words: The literature search is supposed to retrieve mostly relevant references and ignore non-relevant ones. A retrieved relevant record equals a true positive (tp), whereas a retrieved non-relevant record equals a false positive (fp). See Table 2.1 for reference.

Table 2.1: Truth table

record	relevant	irrelevant
retrieved	tp	fp
not retrieved	fn	tn

2.8.1 accuracy

Accuracy, also called fraction correct (FC), is a statistical measure of how well a binary classifier correctly (“true”) identifies a condition (“positive or negative”). It is defined as the ratio of all true classifications (true positives and true negatives) to the total number of classifications. (Haynes and Wilczynski (2004), Lefebvre et al. (2017), Fawcett (2006))

It can be calculated according to the following equation: \[ \text{accuracy} = \tfrac{tp + tn}{tp + fp + fn + tn} \]

2.8.2 precision

Precision, also called positive predictive value (PPV), is a performance metric for the retrieval of information. It is the fraction of all relevant records among all retrieved records, which can be written as:

\[ \text{precision} = \tfrac{tp}{tp + fp} \]

A high-precision search tries to retrieve as few non-relevant records as possible, usually missing out on relevant records.

2.8.3 NNR

The Number Needed to Read (NNR) is another common metric, defined as the number of records necessary to screen to find a relevant record. The NNR is simply the inverse of the precision:

\[ \text{NNR} = \tfrac{tp + fp}{tp} = \tfrac{1}{\text{precision}} \]

2.8.4 sensitivity

Sensitivity, also called true positive rate (TPR), recall or hit rate, is a performance metric for the retrieval of informaton, similar to the precision. It equals the probability with which relevant records are correctly identified. (See Haynes and Wilczynski (2004), Lefebvre et al. (2017)).

It is the ratio of all relevant retrieved records and the total of all relevant records: \[ \text{sensitivity} = \tfrac{tp}{tp + fn} \] The idea of a sensitive search is to retrieve as many relevant records as possible, which results in retrieving more non-relevant records in the process.

2.9 CSV

Comma-separated values (CSV) is a plain text data format to store tabular information, such as bibliographic records. CSV is described in detail in RFC 4180.

A CSV file consists of multiple lines of comma-separated data fields. The first line can be used as a header line listing the names of all columns. Otherwise, each line in the file contains one record with its field values separated by commas. To be interpreted correctly the fields must maintain the same order over all lines and empty fields must not be omitted.

Example

CSV in plain text

Title,Year,Volume,Issue,Start Page,End Page
Design Automation and Implementation of Machine Learning Classifier Chips,2020,8,,192155,192164
Performance Verification of a Target Tracking System With a Laser Rangefinder,2021,9,,30993,31009
Signature-Coordinated Digital Multirelay Protection for Microgrid Systems,2014,29,9,4614,4623
Knowledge Graph-Enabled Cancer Data Analytics,2020,24,7,1952,1967
Failure Analysis of Metal Oxide Arresters under Harmonic Distortion,2016,107,3,167,176
Novel Data-Driven Geolocation Approach for Detecting Smuggled Internet Traffic,2025,13,,36306,36320

CSV displayed as a table

Title	Year	Volume	Issue	Start Page	End Page
Design Automation and Implementation of Machine Learning Classifier Chips	2020	8		192155	192164
Performance Verification of a Target Tracking System With a Laser Rangefinder	2021	9		30993	31009
Signature-Coordinated Digital Multirelay Protection for Microgrid Systems	2014	29	9	4614	4623
Knowledge Graph-Enabled Cancer Data Analytics	2020	24	7	1952	1967
Failure Analysis of Metal Oxide Arresters under Harmonic Distortion	2016	107	3	167	176
Novel Data-Driven Geolocation Approach for Detecting Smuggled Internet Traffic	2025	13		36306	36320

2.10 data field

A data field (also called field or column) is a set of values of a particular data type within a database. For instance the data fields for the author or the title contain text strings whereas the fields for the issue, volume or PubMed ID contain numerical values.

Data fields possess designations in the form of field codes or search field tags such as PMID, AU, TI and AB for the fields of unique identifier, author, title and abstract.

Table 2.2: Data fields PMID, AU, TI shown for three records

PMID	AU	TI
7616995	J. P. Kassirer, M. Angell	Redundant publication: a reminder
16040884	A. K. Akobeng	Principles of evidence based medicine
22071866	T. Young, S. Hopewell	Methods for obtaining unpublished data

It always depends on the syntax of the database or search interface which fields can be searched and by what code they are searchable.

2.11 dataset

Datasets or records of a database are collections of data. In a tabular database they correspond to the rows of the table, as shown in Table 2.2. A record in such a database consists of values for the given columns or data fields of the table.

Records of bibliographic databases are called references. These contain the metadata referring to a publication, such as title, abstract, authors, journal name or publication date.

Datasets within clinical trials registries contain metadata for clinical trials, such as registration number, study type, research institution, study status, etc.

Records within fulltext databases also feature a fulltext document, as opposed to bibliographic databases.

2.12 digital object identifier

A digital object identifier (DOI) is a persistent identifier issued by the DOI foundation. It is used to uniquely identify publications.

DOIs take the form of character strings which consist of a prefix and a suffix, separated by a slash /. The prefix identifies the registrant of the DOI (usually the publisher of an article) and takes the form 10.xxxx, where xxxx is a number greater than or equal to 1000. After the prefix and the slash follows the suffix, which is chosen by the registrant for the particular digital object.

DOIs can be resolved using the website of the International DOI Foundation or the Handle.Net Registry.

Example

doi:10.1000/182 can be resolved via https://doi.org/10.1000/182 or https://hdl.handle.net/10.1000/182 and leads to the DOI handbook.

2.13 eligibility criteria

A very important step at the beginning of any review project is the definition of certain eligibility criteria on the basis of the research question. There are two types of criteria:

Inclusion criteria must be met by studies in order for them to be included in the review. In contrast, if a study meets one or more of the exclusion criteria, it will be excluded from the review. See also McKenzie et al. (2022), Gough et al. (2017).

Example: systematic review about chronic non-cancer pain

Inclusion criteria:
- adults (18 years or older)
- patients with chronic non-cancer pain
- randomized controlled trials
Exclusion criteria:
- acute pain, post-surgical pain
- chronic cancer pain
- pregnancy

2.14 evidence

Scientific evidence or simply evidence is information obtained by conducting experiments or by analyzing empirical data in accordance with the scientific method. Scientific evidence is used to support or disprove scientific hypotheses and in consequence to inform evidence-based decision-making.

2.15 evidence-based medicine

Evidence-based medicine (EBM) is commonly understood as the “conscientious, explicit, and judicious use of current best evidence in making decisions about the care of individual patients. The practice of evidence based medicine means integrating individual clinical expertise with the best available external clinical evidence from systematic research.”(Sackett et al. (1996))

2.15.1 types of evidence

There are various types of research with a varying degree of evidence. This is often represented in the form of a so-called evidence pyramid (see Figure 2.2).

The pyramid shape suggests the availability of a high amount of fundamental research and personal experience as a foundation for scientific studies with an increasing level of evidence. The very top of the pyramid encompasses various types of evidence synthesis, i.e. various kinds of (systematic) reviews (see Section 5.1) which summarize primary studies.

2.16 FAIR

FAIR is an acronym for the four principles Findability, Accessibility, Interoperability and Reusability, which serve as a guideline for the management of scientific or scholarly data. A consortium of scientists and organizations defined these FAIR principles in order to advance the machine-actionability of data. (Wilkinson et al. (2016))

Data that meet these principles are called FAIR Data.

The FAIR Principles

Short description, adapted from PUBLISSO
Findability	Data should be provided with sufficient metadata, such as title, authors, summary, and information about the origin of the data. Moreover, a globally unique and persistent identifier, such as a digital object identifier (DOI), should be assigned to the data.
Accessibility	Data and their metadata should be made long-term accessible through standardized communication protocols, such as https.
Interoperability	Controlled vocabulary (thesauri) such as Medical Subject Headings (MeSH) or formats for metadata such as XML, which can be read by both humans and machines, allow for the creation of interoperable metadata and links between datasets.
Reusability	Reusability depends on the quality of the metadata, a proper description of the provenance of the research data and its citability (for instance by using DOIs) under clearly stated license conditions.

2.17 full-text

The complete texts of publications (e.g. articles, books, chapters, reports, …) are called full-texts.

Most of the available databases for literature searching are bibliographic databases, which means they do not contain the full-text, but bibliographic references to the publications, in most cases including a link to the publisher, where the full-text can be obtained.

Open Access publications can be accessed freely, whereas non-open access publications usually require a paid subscription or access fee. Alternatively, they can be ordered using the document delivery service of a university library.

In some cases, an article is published in more than one place at once. There are full-text databases, such as PubMed Central, which provide access to a full text article parallel to the publisher. For example, the guideline PRISMA-S by Rethlefsen et al. (2021) is available via the journal website or as a PubMed Central record.

full-text retrieval tools

There are tools that can help to identify the shortest path to the full-text:

Also reference management programs often provide ways to retrieve available full-texts for the managed references. Sometimes this automated retrieval fails due to incompatibility or security measures of the publishers, even when the full-text would be otherwise accessible (e.g. EndNote).

In case you are affiliated with a university or similar institution, it is always a good idea to consult your institution’s library catalog for all options available to you for accessing a certain publication.

2.18 indexing

Indexing is the process in which index terms are assigned to records. This is done by the database provider in order to indicate what the referenced document is about, independent of its explicit title or abstract. In other words, an indexed record can be retrieved systematically based on its contextual meaning and implicit contents, rather than by searching verbatim expressions in the text.

2.19 orphan line

All parts of a search strategy are supposed to contribute to the overall result of the database search. In case a search query within a search strategy is not connected (using operators) to the rest of the search strategy, it is called an orphan line.

Example

hypertension/
(hypertension or high blood pressure).ti,ab.
*patient attitude/
*patient satisfaction/
(choice$ or empower$).ti.
1 or 2
3 or 4
6 and 7

Line 5 is an orphan line.

2.20 plain text

plain text (as opposed to styled or rich text) is digital text without applied styling information, such as fonts, font styles, font sizes, colors, images, hyperlinks. (See also the Unicode definition).

Plain text is normally used when formatting is not important, such as when writing code.

Free plain text editors

Editor	available for
vim	Windows, Unix, macOS, iOS, Android
emacs	Windows, Linux, macOS
nano	Windows, Linux, macOS
Notepad++	Windows

2.21 PMID and PMCID

The PubMed ID (PMID) and the PubMed Central ID (PMCID) are unique identifiers assigned to the records within the databases PubMed and PubMed Central. They are similar to the digital object identifier (DOI).

PMIDs are unique integer values, e.g. 32256971, PMCIDs are composed of the prefix PMC followed by a series of numbers, e.g. PMC7106990.

PubMed records can easily be found simply by entering their PMIDs as search terms into the PubMed search.

Conversion tool

The National Library of Medicine provides a tool for the conversion of the PMID, PMCID and DOI into one another. This tool only works for records which are both part of PubMed and PubMed Central.

2.22 RIS

RIS is a bibliographic format which is commonly used by reference management programs, review tools and bibliographic databases. It was developed by Research Information Systems, Inc. (which was aquired by a Thomson Reuters division, which is now Clarivate).

A RIS-file consists of lines of plain text, one line for each data field. Each line begins with a field tag, which consists of either two upper-case letters or an upper-case letter and a single digit, followed by two spaces, a hypen and another space. After the tag the content of the field is written, followed by a carriage return and line feed. RIS files possess the file extension .ris.

Each reference within RIS must begin with TY - to state the type of reference, and it must end with a line ER -, which marks the end of the reference. (See archived RIS Format Specifications)

Example: RIS format

TY  - JOUR
T1  - How to undertake a literature search: a step-by-step guide
AU  - Watson, Mandy
Y1  - 2020/04/09
PY  - 2020
DA  - 2020/04/09
N1  - doi: 10.12968/bjon.2020.29.7.431
DO  - 10.12968/bjon.2020.29.7.431
T2  - British Journal of Nursing
JF  - British Journal of Nursing
JO  - Br J Nurs
SP  - 431
EP  - 435
VL  - 29
IS  - 7
PB  - Mark Allen Group
N2  - Undertaking a literature search can be a daunting prospect. Breaking the exercise down into smaller steps will make the process more manageable. This article suggests 10 steps that will help readers complete this task, from identifying key concepts to choosing databases for the search and saving the results and search strategy. It discusses each of the steps in a little more detail, with examples and suggestions on where to get help. This structured approach will help readers obtain a more focused set of results and, ultimately, save time and effort.
AB  - Undertaking a literature search can be a daunting prospect. Breaking the exercise down into smaller steps will make the process more manageable. This article suggests 10 steps that will help readers complete this task, from identifying key concepts to choosing databases for the search and saving the results and search strategy. It discusses each of the steps in a little more detail, with examples and suggestions on where to get help. This structured approach will help readers obtain a more focused set of results and, ultimately, save time and effort.
SN  - 0966-0461
M3  - doi: 10.12968/bjon.2020.29.7.431
UR  - https://doi.org/10.12968/bjon.2020.29.7.431
Y2  - 2025/07/09
ER  -

2.23 retractions

The review of manuscripts by fellow scientists (aka peer review) as part of the publication process is supposed to protect the scientific community from frauds, detect errors or false conclusions and in doing so uphold a certain level of quality. This, however, is not always enough.

In cases where serious errors or even scientific misconduct are detected only after an article is published, it may get retracted. The retraction can be initiated by the authors themselves, their affiliated institutions or the journal editors. On the publisher’s website and within bibliographic databases, such an article usually gets flagged as being retracted and a retraction notice is published, which explains the cause for the retraction.

Retraction notice vs. retracted publication

In bibliographic databases, such as PubMed, retracted publications and the retraction notices are separate records, which can be searched individually.

Retraction Watch

The blog Retraction Watch, run by the non-profit organization Center for Scientific Integrity, keeps an eye on current retractions and reports on developments in this area.

2.24 seed paper

A systematic literature search often begins with a quick scoping search for a handful of publications that provide an answer to the research question. These publications are called seed paper, key paper or core paper.

Purposes of seed papers

Apart from making oneself familiar with the topic by reading them, these seed papers can be put to use in:

Extraction of index terms and free-text expressions.
Citation searching.
Testing search strategies.

Personal note

From the own experience of the author of this compendium, one can further divide seed papers into two categories:

Publications which answer the research question and which can be used as a template for similar literature which should be picked up by a systematic search, because they might be included in the review.
Publications which do not fully cover all the aspects of the question, but provide background information or search terms for at least one of the main concepts. These do not qualify as included studies for the review project. An example are review articles which are similar (but not identical) to what the researchers have in mind for the present project.

Often, researchers are not aware of this distinction and its implications for the systematic literature search.

Ideally, the systematic search should retrieve all of the type 1 seed papers, whereas type 2 papers may or may not be found by the search. Type 2 paper are useful for preparation, citation searching (e.g. for relevant studies of previous systematic reviews) or to learn important search terms from previous search strategies.

2.25 syntax

Similar to programming, the syntax is the set of rules that applies for setting up search queries and building search strategies in databases. It defines the operators, field codes and special characters (such as wildcard symbols, parentheses, slashes, quotation marks, etc.) that are available within a particular search interface.

As a consequence, search strategies cannot be used freely in every database. They have to be translated due to different syntax and due to different index terms. (See Clark et al. (2020), Glanville et al. (2019), Wanner and Baumann (2019), Damarell et al. (2013)).

Table 2.3: Examples for syntax in different search interfaces

PubMed	`"ocular hypertension"[tiab]`
Embase	`'ocular hypertension':ti,ab`
Ovid	`"ocular hypertension".ti,ab.`
Cochrane Library	`"ocular hypertension":ti,ab`
Scopus	`TITLE-ABS({ocular hypertension})`
Web of Science	`TI=("ocular hypertension") OR AB=("ocular hypertension")`
EBSCOhost	`(TI "ocular hypertension") OR (AB "ocular hypertension")`

2.25.1 syntax errors

The more complex a search strategy gets, the more likely it is to make mistakes in the development or translation between databases.

Common sources of error

Order of searching terms

In PubMed and Ovid all search queries are processed from left to right. In contrast, the Cochrane Library applies the search operators in the order NOT-AND-OR, whereas in Scopus the order is OR-AND-NOT. The easiest way to avoid this is the use of nesting.

Truncated phrases

In some search interfaces the application of truncated search terms within a phrase does not work (properly). For example, in the Cochrane Library the phrase "olfact* system" yields no results and leads to an error message. Instead the query needs to be written as olfact* NEXT system.

Straight quotes vs. curly quotes

Some search interfaces, such as Ovid, only recognize straight quotation marks " " for the use around phrases. However, today’s word processors often automatically transform these into typographically correct „ “ or “ ”, called curly or smart quotation marks. It is possible to deactivate certain autocorrection features to avoid this behavior, otherwise it might be a good idea to work with a plain text editor.

2.26 thesaurus

A thesaurus (ancient greek: θησαυρός (thesaurós) ‘treasury’) is a dictionary of synonyms, which are often ordered alphabetically or hierarchically.

In the context of literature searching, a thesaurus contains a database-specific controlled vocabulary of index terms.

Table 2.4: Commonly known databases and their thesauri

Database	Thesaurus
PubMed/MEDLINE	Medical Subject Headings (MeSH)
Embase	Emtree
Cochrane Library	Medical Subject Headings (MeSH)
CINAHL	CINAHL Subject Headings
APA PsycInfo	Thesaurus of Psychological Index Terms
Global Health	CABI Thesaurus
ERIC	ERIC Thesaurus

These vocabularies usually list preferred terms for indexing, their definitions as well as lists of synonyms for each of those index terms. The index terms are arranged hierarchically, ranging from very broad categories to very specific terms.

MeSH hierarchy from the top of the MeSH tree to Pleural Cavity

All MeSH Categories
  Anatomy Category
    Body Regions
      Torso
        Thorax
          Thoracic Cavity
            Pleural Cavity

The controlled vocabularies are regularly updated, new index terms are introduced or hierarchies rearranged. (See What’s New in MeSH for example).

2.27 vocabulary mismatch

Vocabulary mismatch occurs when an object or circumstance is called or described by different individuals using different expressions, which means the vocabularies of these individuals do not match. This is a well-known phenomenon in information retrieval. (Furnas et al. (1987))

For instance, researchers attempt to retrieve literature using search terms which seem relevant to them or are commonly used in their field, whereas the authors (perhaps from a different special field) might use different (or sometimes unspecific or uncommon) expressions to describe the same subject.

Example

A research group is interested in hypertension, which is why they use the search term hypertension in their free text search. However, some of the studies relevant to their research question mention

… patients with 140/90 mmHg and above.

… participants with an elevated bp.

without the occurrence of the word hypertension in title or abstract. Without additional search terms these publications are not retrieved in the search.

In order to mitigate this problem systematic literature searches usually employ synonyms, truncation, index terms, and proximity operators, or are supplemented by non-Boolean techniques, such as citation searching.

2.28 wildcard

Wildcards are special characters used for truncation. The usage and meaning of the available wildcards for this purpose depends on the syntax of the database or search interface.

Table 2.5: Commonly used wildcard symbols

character	name
`*`	asterisk
`$`	dollar sign
`?`	question mark
`#`	hash sign, pound sign

Table 2.6: Wildcard symbols and the number of characters n they are allowed to replace in different search interfaces

	unlimited (0–∞)	mandatory (1)	optional (0–1)
PubMed	`*`
Ovid	`$` or `*`	`#`	`?`
Cochrane	`*`		`?`
Embase	`*`	`?`	`$`
Scopus	`*`	`?`
Web of Science	`*`	`$`	`?`
EBSCOhost	`*`	`?`	`#`

Boutron, I., M. J. Page, J. P. T. Higgins, D. G. Altman, A. Lundh, and A. Hróbjartsson. 2022. “Chapter 7: Considering Bias and Conflicts of Interest Among the Included Studies.” In Cochrane Handbook for Systematic Reviews of Interventions, edited by J. P. T. Higgins, J. Thomas, J. Chandler, et al. Cochrane.

Braun, Cordula, Christine Schmucker, Monika Nothacker, et al. 2021. Manual Bewertung des Biasrisikos in Interventionsstudien. Albert-Ludwigs-Universität Freiburg. https://doi.org/10.6094/UNIFR/194900.

Cals, Jochen W. L., and Daniel Kotz. 2013. “Effective writing and publishing scientific papers, part II: title and abstract.” J Clin Epidemiol 66 (6): 585. https://doi.org/10.1016/j.jclinepi.2013.01.005.

Clark, Justin Michael, Sharon Sanders, Matthew Carter, et al. 2020. “Improving the translation of search strategies using the Polyglot Search Translator: a randomized controlled trial.” J Med Libr Assoc 108 (2): 195–207. https://doi.org/10.5195/jmla.2020.834.

Damarell, Raechel A., Jennifer J. Tieman, and Ruth M. Sladek. 2013. “OvidSP Medline-to-PubMed search filter translation: a methodology for extending search filter range to include PubMed’s unique content.” BMC Med Res Methodol 13 (1): 86. https://doi.org/10.1186/1471-2288-13-86.

Fawcett, Tom. 2006. “An Introduction to ROC Analysis.” Pattern Recognit Lett 27 (8): 861–74. https://doi.org/10.1016/j.patrec.2005.10.010.

Furnas, G. W., T. K. Landauer, L. M. Gomez, and S. T. Dumais. 1987. “The Vocabulary Problem in Human-System Communication.” Commun ACM (New York, NY, USA) 30 (11): 964–71. https://doi.org/10.1145/32206.32212.

Glanville, Julie, Ruth Foxlee, Susi Wisniewski, Anna Noel-Storr, Mary Edwards, and Gordon Dooley. 2019. “Translating the Cochrane EMBASE RCT filter from the Ovid interface to Embase.com: a case study.” Health Info Libr J 36 (3): 264–77. https://doi.org/10.1111/hir.12269.

Gough, David, Sandy Oliver, and James Thomas. 2017. An Introduction to Systematic Reviews. Second. SAGE Publications.

Haynes, R. Brian, and Nancy L. Wilczynski. 2004. “Optimal search strategies for retrieving scientifically strong studies of diagnosis from Medline: analytical survey.” BMJ 328 (7447): 1040. https://doi.org/10.1136/bmj.38068.557998.EE.

Lefebvre, Carol, Julie Glanville, Sophie Beale, et al. 2017. “Assessing the Performance of Methodological Search Filters to Improve the Efficiency of Evidence Information Retrieval: Five Literature Reviews and a Qualitative Study.” Health Technol Assess 21 (69): 1–148. https://doi.org/10.3310/hta21690.

McKenzie, J. E., S. E. Brennan, R. E. Ryan, H. J. Thomson, R. V. Johnston, and J. Thomas. 2022. “Chapter 3: Defining the Criteria for Including Studies and How They Will Be Grouped for the Synthesis.” In Cochrane Handbook for Systematic Reviews of Interventions, edited by J. P. T. Higgins, J. Thomas, J. Chandler, et al. Cochrane.

Moher, D., B. Pham, M. L. Lawson, and T. P. Klassen. 2003. “The inclusion of reports of randomised trials published in languages other than English in systematic reviews.” Health Technol Assess (England) 7: 1–90. https://doi.org/10.3310/hta7410.

Morrison, Andra, Julie Polisena, Don Husereau, et al. 2012. “The effect of English-language restriction on systematic review-based meta-analyses: a systematic review of empirical studies.” Int J Technol Assess Health Care 28 (2): 138–44. https://doi.org/10.1017/S0266462312000086.

Névéol, Aurélie, Rezarta Islamaj Doğan, and Zhiyong Lu. 2010. “Author Keywords in Biomedical Journal Articles.” AMIA Annu Symp Proc 2010 (November): 537–41. https://pubmed.ncbi.nlm.nih.gov/21347036/.

Odgaard‐Jensen, Jan, Gunn E. Vist, Antje Timmer, et al. 2011. “Randomisation to Protect Against Selection Bias in Healthcare Trials.” Cochrane Database Syst Rev (4). https://doi.org/10.1002/14651858.MR000012.pub3.

Pitkin, Roy M., and Mary Ann Branagan. 1998. “Can the Accuracy of Abstracts Be Improved by Providing Specific Instructions? A Randomized Controlled Trial.” JAMA 280 (3): 267–69. https://doi.org/10.1001/jama.280.3.267.

Rethlefsen, Melissa L., Shona Kirtley, Siw Waffenschmidt, et al. 2021. “PRISMA-S: an extension to the PRISMA Statement for Reporting Literature Searches in Systematic Reviews.” Syst Rev 10 (1): 39. https://doi.org/10.1186/s13643-020-01542-z.

Sackett, David L., William M. C. Rosenberg, J. A. Muir Gray, R. Brian Haynes, and W. Scott Richardson. 1996. “Evidence Based Medicine: What It Is and What It Isn’t.” BMJ 312 (7023): 71–72. https://doi.org/10.1136/bmj.312.7023.71.

Wanner, Amanda, and Niki Baumann. 2019. “Design and implementation of a tool for conversion of search strategies between PubMed and Ovid MEDLINE.” Res Syn Meth 10 (2): 154–60. https://doi.org/10.1002/jrsm.1314.

Wilkinson, Mark D., Michel Dumontier, I. Jsbrand Jan Aalbersberg, et al. 2016. “The FAIR Guiding Principles for scientific data management and stewardship.” Scientific Data (England) 3 (March): 160018. https://doi.org/10.1038/sdata.2016.18.