Researchers have developed a new self-supervised learning method that can improve the training of genomic models with less labeled data. The method, called Self-GenomeNet, leverages reverse-complement sequences and effectively learns short- and long-term dependencies by predicting targets of different lengths. Self-GenomeNet outperforms other self-supervised methods in data-scarce genomic tasks and outperforms standard supervised training with ~10 times fewer labeled training data. Furthermore, the learned representations generalize well to new datasets and tasks. These findings suggest that Self-GenomeNet is well suited for large-scale, unlabeled genomic datasets and could substantially improve the performance of genomic models.
Related Posts
A Collaborative Journey with the Korea Disease Control and Prevention Agency and the Korean Diabetes Association
The partnership between the Korea Disease Control and Prevention Agency (KDCA) and the Korean Diabetes Association (KDA) is an inspiring story of collaboration and collective action against the global challenge of diabetes. By leveraging their unique strengths, these two organizations have implemented groundbreaking initiatives in prevention, education, research, and patient support, ultimately improving the lives […]
Data harmony: New tool links single-cell discoveries
In the ever-changing world of scientific exploration, a groundbreaking tool has surfaced, promising to transform our understanding of how cells work. This innovative tool is like a super organizer for data.It brings together information from individual cells in a way that researchers can easily study and make sense of. As scientists continue to explore the […]
Cellular blueprints decoded: Math unveils the secrets of cell design
Scientists have created a massive collection of 200,000 cell images to explore the inner workings of our cellular world. This visual database acts like an extensive library of blueprints. Using advanced math, they’ve developed a complex framework, like a secret code, to decipher the basic rules governing cell construction. It’s like finding the hidden language […]