Protein Context Search
Search our database of 130,000+ microbial genomes for protein contexts containing a protein most similar (by embedding distance) to your query sequence. A protein context is the set of proteins encoded on the same contig (genomic neighborhood). Returns taxonomic information and functionally annotated proteins within each matching context.
Protein Context Search › Request Body
sequenceThe protein sequence to search for (max length 15000). Standard 20 and ambiguous B, X, Z, U residues are allowed.
diversityFilterControls result diversity by restricting the search to representative proteins at a given sequence identity threshold. low (default): searches the full database, returns most similar results. medium: clusters at 70% identity. high: clusters at 50% identity. max: clusters at 30% identity (most diverse results).
Protein Context Search › Responses
Successful Response
statusProtein Context Search (Multi-Query)
Search our database of 130,000+ microbial genomes for protein contexts where all query proteins co-occur in the same genomic neighborhood. Each query protein is matched by embedding distance, and only contexts containing matches to all queries are returned.
Protein Context Search (Multi-Query) › Request Body
sequencesList of 2–5 protein sequences to search for (each max 15000 characters). Standard 20 and ambiguous B, X, Z, U residues are allowed.
diversityFilterControls result diversity by restricting the search to representative proteins at a given sequence identity threshold. low (default): searches the full database, returns most similar results. medium: clusters at 70% identity. high: clusters at 50% identity. max: clusters at 30% identity (most diverse results).
Protein Context Search (Multi-Query) › Responses
Successful Response
status