UROP Proceedings 2020-21

School of Science Department of Chemistry 7 Deep Learning in Synthesis Planning Supervisor: SU Haibin / CHEM Student: CHUI Sin Yu / CHEM LEE Subin / CHEM WANG Yizhou / CHEM Course: UROP2100, Summer UROP1100, Summer UROP1100, Summer Conditions of reactions are crucial to determine and proceed synthesis planning that is comprehensively applied in many fields of chemistry. Reaction data analysis may help chemists to decide whether the pathway would be more high-efficient without on-hand laboratory work. Utilizing which type of catalyst is also one important condition required in catalytic reactions for organic synthesis. This report is an overview of reactions using copper catalyst with bidentate nitrogen ligands by reaction data analyzing. General classes such as C-C bond formation, reaction types and the usage of bidentate nitrogen ligands are included. Employing Coevolution Analysis for Investigating and Improving Cas9 Specificity Towards More Safe CRISPR-Based Genome Editing Supervisor: SU Haibin / CHEM Student: WOO Dorothy Hoi Shan / BTGBM Course: UROP1100, Fall UROP2100, Spring CRISPR is a natural adaptive immune system found many naturally occurring organisms. While there are many proteins acting as an effector module in CRISPR, CRISPR-associated protein 9, more commonly known as Cas9, has shown great efficiency to be used for targeted genome editing. The CRISPR/ Cas9 system has been applied to many biological research areas, such as gene therapy, bioimaging and drug discovery. However, off-targeting behaviour and specificity of these proteins have not been fully understood, and a more extensive understanding towards the specificity of Cas9 mechanisms is needed to improve genetic engineering and editing techniques. This progress report devotes itself to utilizing the approach of coevolution analysis to study the relative interactions of proteins in the Cas9_BH and HNH_4 domains. Through coevolution analysis, we identified the amino acid residue couplings with high correlation in the two domains respectively, and data can be utilized in further research to study the specificity of Cas9 activity in these domains.