ProteInfer: deep networks for protein functional inference

Summary

This work, completed as part of my residency at Google AI, uses deep residual networks to predict protein function from amino acid sequences. We show that these networks are able to perform this task effectively, in a way that complements BLAST-based approaches, and that they learn to place protein sequences into a generalised embedding space that facilitates downstream applications. Using TensorFlow JS, we built a tool that performs protein functional inference in the browser, client-side. The paper is presented in an interactive form that allows the reader to explore our work and try the models.

Publication
eLife
Theo Sanderson
Theo Sanderson
Sir Henry Wellcome Fellow

Biologist developing tools to scale pathogen genetics.