Prof. Arieh Warshel's Group
Professor Warshel's research covers a wide range of problems in modern biophysical chemistry. He and his coworkers have pioneered several of the most effective models for computer simulations of biological molecule. The studies of Warshel's group include the following:
Simulations of Enzyme Catalysis and Protein Action

Our early works paved the way for quantitative theoretical studies of enzymatic reactions. These works introduced the Hybrid - Quantum Mechanical / Molecular Mechanics (QM/MM) method and a microscopic approach for studies of electrostatic effects in proteins. The QM/MM and related approaches allow other scientists to study the energetics and dynamics of enzymatic reactions. Our group continues to push the frontiers of the field developing new approaches, studying complex effects such as quantum tunneling and entropic effects in enzymes and exploring the action of enzymes of special biological importance.

Simulating the Dynamics of Photobiological Processes

The first molecular dynamics simulation of a biological process was reported by Warshel in a 1976 study of the primary event of the vision process. Warshel's group continues to be very active in this field, studying ultrafast reactions such as the primary processes in the photosynthetic reaction center (where their theoretical study was the first to elucidate the correct electron transfer mechanisms) and the photoisomerization reaction in bacteriorhodopsin.

Simulation of Chemical Reactions in Solution

In order to understand enzymatic reactions it is crucial to have a quantitative picture of the corresponding reference reactions in solutions. The realization of this fact led Warshel's group to spend a major effort on studying the energetics and dynamics of chemical processes in solution. These studies include the use and development of various QM/MM approaches and related models for quantitative simulations of chemical reactions in solutions.

Electrostatic Energies in Macromolecules

Our early works involved the development of the first physically consistent models for studies of electrostatic energies in proteins. The use of these models led to the current realization that electrostatic energies provide the best way of correlating structure and function of biological molecules. The use of calculations of electrostatic energies in analyzing a wide range of biological problems is a major part of the research effort of Warshel's group. This includes evaluation of pKa's and redox potentials of proteins, drug designs, and studies of protein-protein interactions.

Protein Folding

The simplified model for protein folding introduced by Levitt and Warshel is now the method of choice in most studies of protein folding. Our recent effort in this direction has focused on developing innovative ways of using the results of the simplified model in the evaluation of the corrresponding free energies of more detailed all-atom models

Prof. Jenn-Kang Hwang's Group
Prof. Jenn-Kang Hwang is interested in the structure-dynamics- evolution relationship in proteins.
On the fundamental principles relating protein geometrical shapes to protein dynamics and function

Protein average thermal fluctuations are shown be derived directly from protein structures without assuming any mechanical models. A number of protein structural profiles such as residue packing, i.e. the weighted contact  number (WCN), protein centroid distance, thermal fluctuation profile or smoothed solvent accessible surface profiles are similar to a smoothed version of sequence conservation profile in varying degrees. This is surprising, since the WCN profile is derived from a single protein structure, while the sequence conservation profile is derived from multiple sequence alignment of a group of homologous sequences. This suggests that a single protein structure contains evolutionary information that was previously derived only from multiple sequences. We are also interested in the following topics such as knotted proteins, protein subcellular localization, protein structure prediction, disulfide proteins, molecular simulation and any topics that are intellectually interesting.

Selected Publications
  1. Liu JW, Cheng CW, Lin YF, Chen SY, Hwang JK, SC Yen, Relationships between residue Voronoi volume and sequence conservation in proteins BBA – Proteins and Proteomics (2017) (accepted)
  2. Huang TT, del Valle Marcos ML, Hwang JK, Echave J. A mechanistic stress model of protein evolution accounts for site-specific evolutionary rates and their relationship with packing density and flexibility. J. BMC Evol Biol. 2014 Apr 9;14:78. doi: 10.1186/1471-2148-14-78.
  3. Yeh SW, Liu JW, Yu SH, Shih CH, Hwang JK, Echave J. Site-specific structural constraints on protein sequence evolutionary divergence: local packing density versus solvent exposure. Mol Biol Evol. 2014 Jan;31(1):135-9. doi: 10.1093/molbev/mst178.
  4. Yeh SW, Huang TT, Liu JW, Yu SH, Shih CH, Hwang JK, Echave J. Local packing density is the main structural determinant of the rate if protein sequence sequence at the site level. Biomed Res Int. 2014;2014:572409. doi: 10.1155/2014/572409.
  5. Shih CH, Chang CM, Lin YS, Lo WC, Hwang JK. Evolutionary information hidden in a single protein structure. Proteins (2012) 80, 16472.
  6. Lin CP, Huang SW, Lai YL, Yen SC, Shih CH, Lu CH, Huang CC, Hwang JK. Deriving protein dynamical properties from weighted protein contact number, Proteins: Structure, Function and Bioinformatics (2008) 72, 929-935.
  7. Shih CH, Huang SW, Yen SC, Lai YL, Yu SH, Hwang JK. Computation of protein dynamics without a mechanical model, Proteins: Structure, Function and Bioinformatics (2007) 68, 34-38
Prof. Hsien-Da Huang's Group
Laboratory of Computational Health Informatics and Omics (Computational Omics Lab) aims to integrate interdisciplinary approaches to providing insights into important biological problems and the industrial value of clinical applications. We also enable the development of new biological resources with broad applications, especially in microRNAs, protein modifications and gene regulation.
Computational Health Informatics and Omics

Recently, high-throughput sequencing and Brain-Computer interface technology become our major focus both in scientific research and in clinical applications. The ultimate goal in our lab is to get a comprehensive view of complex disease-gene-drug relationships - from the molecular level to biological functions of drug targets and response. Reveal previously unseen patterns across heterogeneous and big datasets to predict targets and biomarkers. Our team will also define a rigorous data-mining program and integrate the data using machine learning or other deep analytical approaches.

A resource for experimentally validated microRNA-target interactions
Dr. Huang’s group developed a miRNA database, miRTarBase, which is an internationally recognized repository for collecting experimental miRNA-Target interactions (MTIs). miRTarBase 2018 update contains 422,517 curated miRNA-target interactions between 4,076 miRNAs and 23,054 target genes, collected from over 8,510 articles. Generally, the collected MTIs have been validated experimentally by 3’UTR-reporter assay, western blotting, microarray analysis and by next-generation sequencing experiments. The users are majorly from the China, United States and India. The database has earned more than 1500 citations. The database is continuously maintained and now broadly utilized and referred in life science and medical research field.
Research / Development Strategy and Direction
  1. Keep outstanding bioinformatics in microRNA and small RNA research areas.
  2. Application of High-throughput technology and cross-domain research in biomedical research.
  3. High throughput multilevel body technology (Multi-Omics) and big data analysis.
  4. Research on Genome Medicine, microbiology, epigenetics, and noncoding RNA.
  5. Development of Next-Generation Sequencing (NGS) and Third-Generation sequencing (TGS) technology and integration analysis methods.
  6. Development of new cancer diagnosis related to miRNA Biomarker and target.
  7. Development of biological databases and bioinformatics tools.
  8. Development of non-invasive detection and precision medical technology.
  9. Development of new database and computing system for transitional Chinese herbal medicine.
  10. Clinical Applications of Brain-Computer Interfaces
Key Words:
DNA level, RNA level, miRNA and small RNA, Next-generation sequencing, Third-generation sequencing, Single cell analysis and sequencing, Omics, Artificial Intelligence, Algorithm, Parallel Computation, Machine Learning, Drug design, Drug development, Traditional Chinese medicine (TCM), Brain-Computer Interface (BCI)
Prof. Tzong-Yi Lee's Group
Dr. Lee’s group research interests include bioinformatics, genomics and proteomics, network biology, data mining in Omics science, shallow and deep learning, database design and software development. His research team currently specialize in gene promoters and transcriptional regulations, circular RNA regulation, microbiota and functional analyses, protein post-translational modifications (PTMs), protein kinases and signaling pathway, protein ubiquitylation and E3 ligases regulations, protein glycosylation and glycosyl-structure identification, antimicrobial peptides (AMPs), multidrug-resistant pathogens diagnosis, and drug repurposing. His longer-term research goals are focusing on the integration of Bioinformatics and Omics science to address emerging biological and clinical questions for precision medicine.
Computational identification of protein PTM sites
Protein PTMs are one of the most important mechanisms of eukaryotic and prokaryotic cells, which involve the attachment of chemical groups to amino acid side chains of proteins. A variety of PTMs have been reported to play crucial roles in diversely cellular processes that modulate a protein’s function, physicochemical properties, conformation, stability, and molecular interactions in response to developmental signals or environmental stimuli. Owing to the labile nature and low abundance of in vivo PTM sites, further research is needed to clarify the characteristics of PTMs and its mechanisms. Therefore, there is an urgent requirement in bioinformatics for a practical approach to investigate the potential substrate motifs of protein PTM sites to be designed. Our research group have developed over 20 tools for prediction of PTM sites based on sequence and structral information.
dbPTM in 2019: exploring disease association and cross-talk of post-translational modifications
The dbPTM has been maintained for over ten years with the aim to provide functional and structural analyses for post-translational modifications (PTMs). In this update, dbPTM not only integrates more experimentally validated PTMs from available databases and through manual curation of literature but also provides PTM-disease associations based on non-synonymous single nucleotide polymorphisms (nsSNPs). The high-throughput deep sequencing technology has led to a surge in the data generated through analysis of association between SNPs and diseases, both in terms of growth amount and scope. This update thus integrated disease-associated nsSNPs from dbSNP based on Genome-Wide Association Studies (GWAS). (
dbAMP: an integrated resource for exploring antimicrobial peptides with functional activities and physicochemical properties on transcriptome and proteome data
An increasing interest therefore was motivated to explore natural AMPs that enable the development of new antibiotics. With the potential of AMPs being as new drugs for multidrug-resistant pathogens, we were thus motivated to develop a database by accumulating comprehensive AMPs from public domain and manually curating literature. Currently in dbAMP there are 12,389 unique entries, including 4271 experimentally verified AMPs and 8118 putative AMPs along with their functional activities, supported by 1924 research articles. The advent of high-throughput biotechnologies, such as mass spectrometry and next-generation sequencing, has led us to further expand dbAMP as a database-assisted platform for providing comprehensively functional and physicochemical analyses for AMPs based on the large-scale transcriptome and proteome data. Significant improvements available in dbAMP include the information of AMP-protein interactions, antimicrobial potency analysis for “cryptic” region detection, annotations of AMP target species, as well as AMP detection on transcriptome and proteome datasets.
Prof. HIRAO, Hajime's Group

Prof. HIRAO's research group is interested in computationally deriving general principles of chemical reactions and molecular interactions occurring in nature and labs, and also in rationally designing therapeutic and other types of functional molecules with experimentalists.

Specific research themes of the group include the following:

1. Development of intuitively appealing methods for analyzing chemical reactivities and molecular interactions: reactive hybrid orbital (RHO); reactive bond orbital (RBO); energy decomposition analysis; reconstruction of molecular wave functions for chemistry purposes.

2. Development of unique molecular mechanics (MM) force-field parametrization schemes: partial hessian fitting (PHF); full hessian fitting (FHF); internal hessian fitting (IHF); geometry amendment (Katachi); genetic and other optimization algorithms; machine learning; Python, etc.

3. Enzyme catalysis: metalloenzymes, proteases, etc.; density functional theory (DFT); ab initio quantum chemistry; quantum mechanics and molecular mechanics (QM/MM); ONIOM; molecular dynamics (MD); catalytic cycles; QM/MM energy decomposition; free energy calculations.

4. Computational drug design: discovery of potent and selective enzyme inhibitors; docking simulation; analysis of protein-ligand interactions; drug metabolism; mechanism-based inactivation (MBI); central nervous system (CNS); β-lactam antibiotics; DNA-targeting anti-cancer drugs.

5. Bioinorganic chemistry: heme enzymes (cytochrome P450, etc.); nonheme enzymes (myo-inositol oxygenase (MIOX), hydroxyethylphosphonate dioxygenase (HEPD), etc.); high-spin reactivity of iron(IV)-oxo intermediates; metal-oxo complexes bearing synthetic ligands; C–H bond activation.

6. Homogeneous and heterogeneous catalysis: transition-metal catalysis; photocatalytic reactions; main group chemistry; organocatalysis; reactions catalyzed by heterogeneous systems such as nanoparticles and metal–organic frameworks (MOFs); machine learning; QM/QM hybridization.

7. Porous materials: MOFs; pillararenes; zeolites; molecular organic cages; porous organic frameworks (POFs); adsorption, separation, purification, catalytic activation, and sensing of guest molecules; QM/MM, MM, and plane-wave density functional theory (DFT) treatments of porous systems.

Prof. Gui-Juan Cheng's Group
Research in Dr. Cheng's group involves the use of combined theoretical and experimental methods to investigate enzymatic and organic reactions.
Computational and experimental study of biological & chemical catalysis

Enzymes play important roles in many key biological processes and therefore are major targets for biomedical research. Better understanding of enzymes is essential for analyzing the enzyme activities, and for the design of inhibitors as pharmaceutical lead compounds. Our particular interest is the epigenetic enzymes, metalloenzymes, biosynthesis, and inhibitor design. The main computational methods used are combined quantum mechanical/molecular mechanical modelling (QM/MM), molecular dynamics simulation and molecular docking.

Organic synthesis greatly facilitates the development of modern medicine, but there are many challenges to overcome, such as creating new reaction types, increasing reaction efficiency, and decreasing the cost and waste. Mechanism understanding of organic reactions provides vital information to optimize reactions and design new reactions. We use quantum mechanics (QM) method and experimental methods (MS, EPR, electrochemical methods, etc.) to understand, predict, optimize, and design organic reactions. The current primary area of interest includes organic synthesis of drug molecules, bio-active compounds and natural products, transition-metal catalyzed reactions, radical reactions and electrochemical reactions.

Other research interests include computer-aided drug design and fundamental principles of enzymatic and chemical catalysis

Prof. Lizhe Zhu's Group

Dr. Zhu’s current research focuses on RNA-protein interactions, mechanism of RNA/DNA interference and the development of automated path searching methods and other enhanced sampling techniques.

Development of enhanced sampling techniques
such as automated path searching methods and investigation of functional dynamics for biological systems

The group's research currently focus on the development of enhanced sampling techniques such as automated path searching methods and the biological applications.

  1. To develop the method to locate multiple transition pathway (High-energy parallel cascade selection, HePaCS) and to optimize the multiple transition paths (multiple Traveling-salesman based Automated Path Searching, mTAPS);
  2. To investigate the molecular mechanism of RNA/DNA interference with argonaute.
  3. To investigate the molecular mechanism of small-molecule induced homo/hetero-dimerization effects of MEK1 kinase.
Prof. Bai's Group
Multi-Scale Modeling and Simulation of Large Biophysical Systems
The high-resolution cryo-EM technology provides detailed structural information of biophysical systems. However, the mechanism of how molecular machines work is still illusive. A main goal of our group is to construct the free energy landscape of macromolecular machines during its biological process and to obtain key dynamical information. With the application of multi-scale simulation tools that was developed by Prof. Arieh Warshel (Nobel Prize in Chemistry, 2013), our calculations reveal the functionalities of systems of interest, which could serve for the AI aided drug design in the next stage.
AI-Aided Drug Design (AADD)

Artificial intelligence (AI) has profoundly changed the mode of drug research and development (R&D) industry. In contrast to traditional methods, AADD has the ability to explore much larger chemical space by computer, thus improving the structural novelty of drug molecules and reduce the time of the cycle. Our group aims to develop an interdisciplinary AADD platform for novel drug discovery and modification. The platform combines the world-leading computational biology technologies, ultra-large scale high-throughput virtual screening technologies, and the AI algorithms based on original molecular features from mechanism studies.