TY - JOUR
T1 - Genetic score omics regression and multitrait meta-analysis detect widespread cis-regulatory effects shaping bovine complex traits
AU - Xiang, Ruidong
AU - Fang, Lingzhao
AU - Liu, Shuli
AU - Liu, George E.
AU - Tenesa, Albert
AU - Gao, Yahui
AU - Mason, Brett A.
AU - Chamberlain, Amanda J.
AU - Goddard, Michael E.
AU - Canela-Xandri, Oriol
AU - Wang, Sheng
AU - Yu, Ying
AU - Cai, Wentao
AU - Li, Bingjie
AU - Pairo-Castineira, Erola
AU - D'Mellow, Kenton
AU - Rawlik, Konrad
AU - Xia, Charley
AU - Yao, Yuelin
AU - Navarro, Pau
AU - Rocha, Dominique
AU - Li, Xiujin
AU - Yan, Ze
AU - Li, Congjun
AU - Rosen, Benjamin D.
AU - Van Tassell, Curtis P.
AU - Vanraden, Paul M.
AU - Zhang, Shengli
AU - Ma, Li
AU - Cole, John B.
AU - CattleGTEx Consortium
N1 - Publisher Copyright:
© 2025 The Author(s). Published by Oxford University Press on behalf of National Academy of Sciences.
PY - 2025/7
Y1 - 2025/7
N2 - To complete the genome-to-phenome map, transcriptome-wide association studies (TWAS) are performed to correlate genetically predicted gene expression with observed phenotypic measurements. However, the relatively small training population assayed with gene expression could limit the accuracy of TWAS. We propose genetic score omics regression (GSOR) correlating observed gene expression with genetically predicted phenotype, i.e. estimated breeding values (EBVs) in agriculture or polygenic score (PGS) in medicine. The score, calculated using variants near genes with assayed expression (cis-EBV or cis-PGS), provides a powerful association test between cis-effects on gene expression and the trait. In simulated and real data, GSOR outperforms TWAS in detecting causal/informative genes. We applied GSOR to transcriptomes of 16 tissues (N ∼ 5,000) and 37 traits in ∼120,000 cattle and conducted multitrait meta-analyses of omics-associations (MTAO). We found that, on average, each significant gene expression and splicing mediates cis-genetic effects on 8-10 traits. Many prioritized genes by GSOR and MTAO can be verified by Mendelian randomization analysis and show significantly reduced dN/dS, suggesting elevated evolutionary constraint for these genes. Using multiple methods, we detect expression levels of genes and/or RNA splicing events underlying previously thought single-gene loci to influence multiple traits. For example, the expression and RNA splicing of DGAT1 from multiple tissues regulated milk production, mastitis, gestation length, temperament, and stature. Also, gene expression and splicing of ABO (Histo-blood group) and ACHE (acetylcholinesterase, Cartwright blood group) affected protein concentration and mastitis, respectively. Taken together, our work provides new methods and biological insights for prioritizing informative omics-phenotype associations in mammals.
AB - To complete the genome-to-phenome map, transcriptome-wide association studies (TWAS) are performed to correlate genetically predicted gene expression with observed phenotypic measurements. However, the relatively small training population assayed with gene expression could limit the accuracy of TWAS. We propose genetic score omics regression (GSOR) correlating observed gene expression with genetically predicted phenotype, i.e. estimated breeding values (EBVs) in agriculture or polygenic score (PGS) in medicine. The score, calculated using variants near genes with assayed expression (cis-EBV or cis-PGS), provides a powerful association test between cis-effects on gene expression and the trait. In simulated and real data, GSOR outperforms TWAS in detecting causal/informative genes. We applied GSOR to transcriptomes of 16 tissues (N ∼ 5,000) and 37 traits in ∼120,000 cattle and conducted multitrait meta-analyses of omics-associations (MTAO). We found that, on average, each significant gene expression and splicing mediates cis-genetic effects on 8-10 traits. Many prioritized genes by GSOR and MTAO can be verified by Mendelian randomization analysis and show significantly reduced dN/dS, suggesting elevated evolutionary constraint for these genes. Using multiple methods, we detect expression levels of genes and/or RNA splicing events underlying previously thought single-gene loci to influence multiple traits. For example, the expression and RNA splicing of DGAT1 from multiple tissues regulated milk production, mastitis, gestation length, temperament, and stature. Also, gene expression and splicing of ABO (Histo-blood group) and ACHE (acetylcholinesterase, Cartwright blood group) affected protein concentration and mastitis, respectively. Taken together, our work provides new methods and biological insights for prioritizing informative omics-phenotype associations in mammals.
UR - https://www.scopus.com/pages/publications/105011596333
U2 - 10.1093/pnasnexus/pgaf208
DO - 10.1093/pnasnexus/pgaf208
M3 - Article
C2 - 40688095
AN - SCOPUS:105011596333
SN - 2752-6542
VL - 4
JO - PNAS Nexus
JF - PNAS Nexus
IS - 7
M1 - pgaf208
ER -