Publication:
A comparison of topologically associating domain callers over mammals at high resolution

dc.contributor.authorSefer, Emre
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorSEFER, Emre
dc.date.accessioned2023-06-08T13:03:02Z
dc.date.available2023-06-08T13:03:02Z
dc.date.issued2022-04-12
dc.description.abstractBackground: Topologically associating domains (TADs) are locally highly-interacting genome regions, which also play a critical role in regulating gene expression in the cell. TADs have been first identified while investigating the 3D genome structure over High-throughput Chromosome Conformation Capture (Hi-C) interaction dataset. Substantial degree of efforts have been devoted to develop techniques for inferring TADs from Hi-C interaction dataset. Many TAD-calling methods have been developed which differ in their criteria and assumptions in TAD inference. Correspondingly, TADs inferred via these callers vary in terms of both similarities and biological features they are enriched in. Result: We have carried out a systematic comparison of 27 TAD-calling methods over mammals. We use Micro-C, a recent high-resolution variant of Hi-C, to compare TADs at a very high resolution, and classify the methods into 3 categories: feature-based methods, Clustering methods, Graph-partitioning methods. We have evaluated TAD boundaries, gaps between adjacent TADs, and quality of TADs across various criteria. We also found particularly CTCF and Cohesin proteins to be effective in formation of TADs with corner dots. We have also assessed the callers performance on simulated datasets since a gold standard for TADs is missing. TAD sizes and numbers change remarkably between TAD callers and dataset resolutions, indicating that TADs are hierarchically-organized domains, instead of disjoint regions. A core subset of feature-based TAD callers regularly perform the best while inferring reproducible domains, which are also enriched for TAD related biological properties. Conclusion: We have analyzed the fundamental principles of TAD-calling methods, and identified the existing situation in TAD inference across high resolution Micro-C interaction datasets over mammals. We come up with a systematic, comprehensive, and concise framework to evaluate the TAD-calling methods performance across Micro-C datasets. Our research will be useful in selecting appropriate methods for TAD inference and evaluation based on available data, experimental design, and biological question of interest. We also introduce our analysis as a benchmarking tool with publicly available source code.en_US
dc.description.versionPublisher versionen_US
dc.identifier.doi10.1186/s12859-022-04674-2en_US
dc.identifier.issn1471-2105en_US
dc.identifier.issue1en_US
dc.identifier.scopus2-s2.0-85128158888
dc.identifier.urihttp://hdl.handle.net/10679/8366
dc.identifier.urihttps://doi.org/10.1186/s12859-022-04674-2
dc.identifier.volume23en_US
dc.identifier.wos000781938200002
dc.language.isoengen_US
dc.peerreviewedyesen_US
dc.publicationstatusPublisheden_US
dc.publisherBioMed Central Ltden_US
dc.relation.ispartofBMC Bioinformatics
dc.relation.publicationcategoryInternational Refereed Journal
dc.rightsopenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.keywordsChromatin organizationen_US
dc.subject.keywordsHi-Cen_US
dc.subject.keywordsMicro-Cen_US
dc.subject.keywordsTAD callersen_US
dc.subject.keywordsTopologically associating domainsen_US
dc.titleA comparison of topologically associating domain callers over mammals at high resolutionen_US
dc.typearticleen_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A comparison of topologically associating domain callers over mammals at high resolution.pdf
Size:
3.72 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections