How to search on EGA

Tip 1: Use simple and clear keywords

No matter what you are seeking: a study, a dataset or a committee; the search will look recursively in all the fields of all the entities in the database. Thus, there is no need to specify which type of data you are looking for. Try to use single and specific words and avoid long and complex expressions.

Try it out: cancer rna-seq instead of cancer studies using rna-seq


Tip 2: Narrow your result using search operators

Search operators give you more control over the results. They manage the way keywords are used to restrict the search. By default, the search engine applies an OR operator when a space is inserted between two words. Include AND (capitalised) between the keywords in your search in order to show only pages containing both term.

Try it out: Search all the studies based on transcriptomic data, including those ones that use RNA sequencing to obtain this data


Tip 3: Do not worry about details

The search engine automatically checks for the most common spelling mistakes of a given word and suggest you to use the correct version. It also ignores capital letters and most punctuation so none of these mistakes will hamper the accuracy of your search. Besides, when using a combination of words, the search engine will suggest similar combinations with a higher number of results.


Tip 4: Use words that are likely to appear within the text

Unfortunately, our search engine is not able to interpret words yet. Therefore, words which do not appear in the body text will not return any result despite they denote a group of present terms. For instance, if we are looking for studies focused on neurological diseases, using neurological as keyword would not be the best option. Instead, you should use the name of each neurological disease such as bipolar disorder, squizophrenia or depression.

In the same fashion, incomplete words do not return any result. Please include asterisk before and/or after the incomplete word in order to allow for partial matching.

Try it out: canc* instead of canc (without wildcard). Also you can try out *oma, or even *geneti*