Crash Course: Python Scripting for Biochemistry & Molecular Biology
Thursday, April 20th, 2023 • 12:00 PM EDT
Access course video recordings and materials at PDB-101
Learning Objectives
In the first workshop, we explored some Python syntax, loops, logic, file parsing, working with multiple files, and molecular visualization using Python in Jupyter notebooks. If you have not used Python or done coding before, we encourage you to watch the first workshop and practice coding there.
We are offering this second workshop virtually and we will continue to use a live-coding approach. It will not be just another lecture. You will be coding right along with us as you learn additional skills, using some new libraries that we will introduce.
Each live coding segment will be followed by a hands-on programming exercise to introduce you to the programming language and to help you identify the challenges associated with coding. Following a brief review of Python and the Jupyter Notebook environment, you will learn to build and extract data from data frames using the Pandas library. Then you will learn curve fitting with enzyme kinetics data. We will explore two different methods to extract data from mmCIF files (the default storage format for the RCSB PDB) using the Biopython library and the RCSB PDB Application Program Interface (API). You will be exposed to lot of new material!
If time allows, we are also going to learn about molecular visualization in Jupyter Notebooks using the iCN3D tool. One of our major goals is to impart sufficient confidence that you will move beyond this workshop to explore other Python libraries and approaches for data analysis and presentation that apply directly to your professional development.
Who Should Attend
- Undergraduate students in any STEM discipline
- Graduate students and postdoctoral researchers in life sciences and data collection
- Professionals involved in collecting and analyzing data, particularly big data
Program | Download Printable Program
12:00–12:05 PM |
Introduction |
12:05–12:30 PM |
Python Primer |
12:30–1:00 PM |
Working with Pandas |
1:00–1:30 PM |
Questions and Hands-On Exercise |
1:30–1:45 PM |
Break |
1:45–2:15 PM |
Nonlinear regression |
2:15–2:45 PM |
Questions and Hands-On Exercise |
2:45–3:00 PM |
Break |
3:00–3:30 PM |
Biopython and mmCIF files |
3:30–4:00 PM |
Questions and Hands-On Exercise |
4:00–4:15 PM |
Break |
4:15–4:45 PM |
PDB API and mmCIF files |
4:45–5:00 PM |
Break |
5:00–5:55 PM |
Molecular Visualization with iCN3D in Jupyter Notebooks |
5:55-6:00 PM |
Closing |