20230209_pilot_DEG
20230209_pilot_DEG
20230209_pilot_DEG
2023.02.09
Pei-Yu Lin
Workflow
Core gene identification Essential gene prediction Local EGs detection
Analysing network
Start statistics between global
Network statistic analysis and local EGs
Collecting 281 plant genomes
Collecting 281 plant genomes
Stop
Essential gene collection
• Source: DEG database (http://origin.tubic.org/deg/public/index.php )
• Species:
Species Known EGs Abbreviation Download
Arabidopsis thaliana 356 DEG2003xxxx link
Mapping known EGs to subnets
• Tool: MMseqs2 (https://github.com/soedinglab/MMseqs2) 5 mins (88 cores 1.48 T)
• mmseqs easy-search all_plants.fa.gz DEG20.aa.gz deg.m8 tmp
• Tool: map_eg.R (https://github.com/beritlin/FunCore/blob/master/map_eg.R)
• Output: (examaple)
• Figures
With known EG
# subnets
Without known EG
# species