Back to Explore
q-bio.GNQuantitative Biology

Genomics

Genomics, gene expression, sequencing

14.2GNMar 31
GenoBERT: A Language Model for Accurate Genotype Imputation

Lei Huang, Chuan Qiu, Kuan-Jui Su et al.

This provides a scalable and robust solution for genotype imputation in genomic studies, addressing ancestry bias and rare-variant accuracy limitations, though it is incremental as it adapts existing transformer methods to this domain.

4.8CLApr 7
PhageBench: Can LLMs Understand Raw Bacteriophage Genomes?

Yusen Hou, Weicai Long, Haitao Hu et al.

This addresses the need for better tools in microbiology and biotechnology by assessing LLMs' potential for genomic interpretation, though it is incremental as it focuses on benchmarking rather than a new model.

4.3CLJun 3
GENEB: Why Genomic Models Are Hard to Compare

Daria Ledneva, Mikhail Nuridinov, Denis Kuznetsov

Provides a standardized evaluation framework for the genomic ML community to enable principled model comparison and selection.

9.8GNMay 11
GeneZip: Region-Aware Compression for Long Context DNA Modeling

Jianan Zhao, Xixian Liu, Zhihao Zhan et al.

For researchers in genomics and long-context DNA modeling, GeneZip provides an efficient compression method that reduces computational costs and enables larger models, but it is an incremental improvement over existing encoder-based compressors.

8.6DSMar 16
Hecate: A Modular Genomic Compressor

Kamila Szewczyk, Sven Rahmann

This addresses the problem of efficient genomic data storage and access for bioinformatics researchers, offering incremental improvements in speed and compression.