ICLR2025

Learning to Discover Regulatory Elements for Gene Expression Prediction

Xingyu Su, Haiyang Yu, Degui Zhi, Shuiwang Ji

Abstract

Three Key Components ๐‘น ๐’ˆ -Genomic candidates: Regulatory elements that could interact with the target gene; but may be inactive in certain cell types ๐‘น ๐’Ž -Measured regions: Detected via epigenomic signals may correlate with expression, but their true target gene is often unknown ๐‘น ๐’‚๐’ˆ -Active and causal regulatory elements: Regulatory elements that directly influence target gene expression. These are the true causal drivers we aim to discover Department of Computer Science & Engineering Causal Relationship ๐‘ฟ ๐’”๐’†๐’’ โ† ๐‘น ๐’ˆ : DNA sequence containing of all regulatory elements ๐‘… ๐‘” interacted with target gene and other non-causal regions ๐‘น ๐’‚๐’ˆ โ†’ ๐’€: Causal active regulatory region that directly affects the gene expression ๐‘น ๐’ˆ โ† ๐‘น ๐’‚๐’ˆ โ†’ ๐‘น ๐’Ž : Key causal region shared by both DNA and epigenomic signals ๐‘น ๐’Ž โ†’ ๐‘ฟ ๐’”๐’Š๐’ˆ : Regulatory elements ๐‘… ๐‘š by measurement reflecting active regions in the cell, often showing as peaks in experiments