WikiPathways: A Wikipedia for biological pathways

Since its inception in 2008, WikiPathways, a wiki for biological pathways, has grown to include over 2,800 individual pathways. It currently has over 700 active users and boasts more than 500 citations in scientific literature.

What is WikiPathways?

WikiPathways is a structured knowledge base for biological pathways that is collaboratively edited by the scientific community. Like Wikipedia, it allows anyone to create or edit encyclopedia records which are all moderated and curated by other WikiPathways users.

Each WikiPathways entry combines genes, proteins, metabolites, and interactions involved in a single pathway or process. All elements of the pathway have links to corresponding entries in reference databases such as UniProt and PubChem, making WikiPathways an important crystallisation point for integrating knowledge about biological pathways.

Applications of WikiPathways

A large biological pathway database, WikiPathways is used for a variety of applications including omics and gene expression analyses.

The dataset can also be used to perform other types of data analysis, visualise research data in the context of pathways, and build pathway models. Moreover, the WikiPathways website itself is a useful resource for learning about pathways.

Accessing WikiPathways data

The primary way to access WikiPathways is via wikipathways.org, the official website of the project. The data can also be accessed programmatically via the web service API.

WikiPathways entries are also referenced in Wikipedia, Wikidata, and other open datasets.

WikiPathways as linked data

Since 2016, the WikiPathways dataset has also been provided in the RDF (linked data) format. The data can be queried programmatically using the SPARQL endpoint which also provides a user-friendly SPARQL query editor.

The WikiPathways RDF dataset uses two core vocabularies and the standard set of identifiers for genes, proteins, and metabolites to support linking to other linked biological data sources. This enables users to utilise SPARQL federation to easily combine information from WikiPathways with other linked data sets.

The bottom line

The collaboratively edited WikiPathways encyclopedia was launched in 2008 as an "experiment", and today integrates the data for over 2,800 biological pathways and makes it easily accessible both via wikipathways.org and programmatically as linked data.

The Wikipedia-like style of WikiPathways makes it stand out from other omics datasets. It actively promotes the collaborative approach to research and sharing of biological knowledge.

See also

Explore the human genome.
bioinformatics, genomics
An outline of the structure of the Gene Ontology RDF graph and ways to query it.
bioinformatics, RDF
Generating a complete human genome sequence, chromosome by chromosome.
bioinformatics
Convert FASTQ files to FASTA format.
FASTQ, FASTA, bioinformatics
A tool to generate plasmid maps from GenBank files.
bioinformatics, SVG
Create pretty sequence logo diagrams online.
bioinformatics, sequence analysis, FASTA
Open FASTA files in the browser.
bioinformatics, FASTA
A WASM port of the MUSCLE sequence alignment tool.
MUSCLE, WASM, bioinformatics
How progressive alignment makes it possible to efficiently align hundreds to thousands of large genomes.
bioinformatics
Learn about the FASTA format and its applications.
FASTA, bioinformatics
A WASM port of MrBayes.
MrBayes, WASM, bioinformatics
A WASM port of the BLAST sequence alignment tool.
BLAST, WASM, bioinformatics

Made by Anton Vasetenkov.

If you want to say hi, you can reach me on LinkedIn or via email. If you like my work, you can support me by buying me a coffee.