Victoria Cheung, PhD
Computational Biologist | Health Data Scientist
Education
(UCSF) University of California, San Francisco, Ph.D. Genetics; concentration in Neuroscience
(UCSD) University of California, San Diego, B.S. Microbiology; minor: Chinese Studies
Additional Courses
Genentech L.E.A.D Discovery Program, Certification. Supply Chain Management
Cold Spring Harbor Laboratory, Vision: Linking Circuits, Perception, and Behavior
Experience
Career-related
APR 2022 - Present
Freenome
Computational Biologist
- Apply bioinformatics, data science, and computational methods (including AI/ML techniques) to analyze multi-omic data to reveal, model, and interpret changes in both the cancer (pathways, gene activities, proteins) and the immune system (composition, activity, and repertoires) associated with clinical outcomes.
- Generate new insights and interpretations.
- Leverage existing computational methods and develop new ones to extract immunological signals from existing and new data.
- Partner cross-functionally in the scientific planning and execution of collaborative projects, such as molecular and cancer biologists, immunologists, computational biologists, medical affairs, commercial, business development.
- Execute research projects to model various biological changes resulting from diseases such as cancer, autoimmune disease, and infection with various business partners. (Publicly disclosed partners include: SIEMENS Healthineers, ADCT).
- Developed 2 software packages for reproducible data analysis for the team.
- (1) Iterated on Freenome’s internal fragmentomics modeling architecture to predict gene activation scores from cfDNA.
- (2) Wrote distributed workflows (Flyte) to increase efficiency of scRNAseq alignment and data aggregation from a scale of running for 8 days to half a day.
- Computational Lead on an internal project that validates and characterizes features of Freenome’s fragmentomics model, 1st author manuscript in progress.
- Worked collaboratively on modeling multi-omics plasma data (DNA methylation, proteomics, fragmentomics) to build classifiers for early stage breast cancer detection in partnership with SIEMENS Healthineers, and in determining predictive biomarkers for overall survival in DLBCL in partnership with ADCT (ASH 2022 abstract 2nd author, AACR 2023 abstract co-first author)
- Worked with Freenome’s machine learning models to detect colorectal cancer disease burden from deep methylation sequencing of plasma, performing longitudinal monitoring on patients validated with imaging data (AACR 2024 abstract, co-first author)
- LOD quantification of Freenome’s computational fragmentomics approach. Successfully determined the minimum quantity of input mass required to obtain reliable and accurate readouts to establish analytical sensitivity. (AACR 2024 abstract, 2nd author)
- Mentored other scientists on the team through technical support, infrastructure support, as well as leading journal club discussions.
Technologies used:
- Python (ScanPy, NumPy, Pandas, scikit-learn, Matplotlib, SciPy, PyTorch, Freenome proprietary software)
- R (fgsea, Seurat)
- flyte
- AppScript
SEP 2021 - APR 2022
Genentech
Oncology Bioinformatics & Molecular Oncology PhD Intern
- Characterized gene signature development and refinement for T cell signaling pathways in cancer models
- Wrote a data processing pipeline utilizing Scanpy, Numpy, Pandas, scikit-learn.
- Performed statistical analyses on different drug treatment populations (gene set enrichment analysis, differential gene expression analysis).
- Utilized supervised batch correction techniques and unsupervised clustering algorithms (UMAP, topic modeling) to visualize and analyze single cell RNA seq data outputs.
- Wrote custom plotting functions using Matplotlib to better visualize the effect of drug treatments.
Technologies used:
- Python (ScanPy, NumPy, pandas, scikit-learn, Matplotlib, SciPy)
- R (fgsea, SingleCellExperiment, Seurat)
JUL 2016 - SEP 2021
Evan Feinberg Lab @ UCSF
Graduate Resarcher in Single-cell Omics, Systems Neuroscience
Project 1
Developed a multiplexed, high-throughput, single-cell sequencing method for neurons that preserve connectivity information in addition to obtaining molecular identity (VECTORseq). Git Repo here.
- Wrote the data processing pipeline using Python after genome alignment using Cellranger (10x Genomics) on an AWS EC2 instance.
- Used unsupervised machine learning techniques such as t-SNE/UMAP clustering to match molecular identities to cellular function and role in behavioral output.
- Implemented nearest neighbors algorithms to account for batch differences when merging datasets.
- Streamlined brain dissociation techniques and increased neuron survivability yield 100-fold based on data-driven outcomes from clustering analyses.
- Validated clustering results of single-cell sequencing against the 2020 10x sequencing dataset from the Allen Atlas and that the methodology was functional.
- Evaluated range of highly variable genes expressed per cluster for the validation of cell identity.
- Managed collaborations with the Chan-Zuckerberg Biohub (Spyros Darmanis Group, now @ Genentech).
Technologies used:
- AWS (EC2/S3)
- Linux, bash, CellRanger
- Python (ScanPy, NumPy, pandas, scikit-learn, Matplotlib, SciPy)
- FIJI, Zen
- Illumina Next Gen Sequencing, 10x Genomics 5’ Sequencing
- stereotaxic surgeries, viral delivery
Project 2
Designed an audition-based behavioral paradigm to study sensorimotor integration in the context of mice.
- Wrote custom software to support custom-built hardware using serial communication between MATLAB and an Arduino microprocessor, which increased productivity by - 6-fold from the parallelization and automation of data acquisition, storage, and analysis.
- Used this system in exploring how sensory input is represented in the brain and transformed into behavioral commands, using mice as the model organism.
- Wrote custom analyses software to automate, refine, and interpret both raw behavioral data and fiber photometry signals.
- Used CAD software to design and 3D print custom behavioral apparatuses.
- Refined surgical protocols to increase survival surgery success by 20%.
- Delivery of viruses, drugs, and organic dyes into the mouse brain.
- Performed physiology recordings on brain slices to validate optogenetic and fiber photometry experiments.
- Assembled fiber photometry and optogenetic manipulation equipment to record and perturb neuronal activity in the context of quantitative behavioral assays.
Technologies used:
- CAD Software (Onshape, Cura, eMachineShop)
- MATLAB
- Arduino (Uno)
- FIJI, Zen
- stereotaxic surgeries, viral delivery, fiber optic implants
- fiber photometry, optogenetics
- immunohistochemistry
MAY 2020 - JUL 2020
Insight Data Science @Silicon Valley
Health Data Science Fellow
Developed a predictive clinical calculator to assess Acute Kidney Injury in hospitalized patients, which would result in better management, care/medication dosing, injury prevention, and reduced hospital length of stay, thus freeing up occupied resources and minimizing financial costs to both patient and hospital.
- Utilized PostgreSQL querying to gather relevant data from the MIMIC-III database and manipulated the data with Python Pandas from 25 tables of data, 46,000 patients, thousands of diagnoses and lab tests, and clinical documentation– generating over 3 million rows of data and 70 unique features comprising lab tests and demographic information.
- Used supervised machine learning in Python such as regression models from scikit-learn and XGBoost to forecast Acute Kidney Injury, with a predictive accuracy of ~91%.
- Medium Article in Towards Data Science: Predicting Acute Kidney Injury in Hospitalized Patients Using Machine Learning.
Technologies used:
- Python (NumPy, pandas, scikit-learn, Matplotlib, SciPy, XGBoost)
- SQL
- AWS (EC2, S3, Route 53)
- Streamlit
Other
2013-2018
Genentech
[Genentech] Discovery Program L.E.A.D Supply Chain
Certification
- Learned about the fundamentals of supply chain, how the supply chain spans a variety of roles throughout Genentech’s delivery of therapies as well as its involvement in providing medication access to underserved communities and its drive towards sustainability.
- Chatted with supply chain business leaders to interact with individuals in the industry.
- Discussed the transferability of skills from the PhD to business/supply chain.
- Chatted with supply chain business leaders to interact with individuals in the industry.
- Participated as Operations Lead in a supply chain simulation where my team and I placed second overall.
APR 2016 - JUN 2016
Guo Huang Lab @ UCSF
Graduate Researcher–rotation
Area of Research: Regenerative Potential of Cardiomyocytes
- Performed heart explants for cell culture and subsequent drug studies for the purpose of exploring organ regeneration and repair in neonatal mice—with an emphasis on the pathways that regulate resident stem cell activation and mature cell de-differentiation/proliferation.
- Explored organ regeneration and development from an evolutionary standpoint across different species of animals i.e. naked mole rat, finch, rat, mouse, zebrafish.
- Utilized innovative and integrated approaches in engineering, single cell analysis, advanced imaging microscopy, drug delivery, and genome manipulation technology.
SEP 2015 - DEC 2015
Dengke Ma Lab @ UCSF
Graduate Researcher–rotation
Area of Research: Homeostatic Response to Extreme Abiotic Factors
- Created a functional mutant line in C. elegans via cDNA microinjections and exposed the mutants to extreme abiotic environments via behavioral assays to understand cellular intrinsic tolerance of hypoxia/anoxia and hypothermia.
- Utilized RNA-seq to identify genes implicated in cryopreservation/hypoxia-tolerance with therapeutic potential.
- Obtained qualitative behavioral data on how animals sensed and responded to changes in internal states to elicit behavior and maintain homeostasis.
SEP 2012 - JUL 2015
Andrew D. Huberman Lab @ UCSD
Reasearch Fellowship: UCLEADS & STARS
Area of Research: Binocular Plasticity & Dynamic Strategy Implementation in Cuttlefish
- Developed a model of visual perception and prey capture using cuttlefish to study the neural circuit organization supporting flexible eye movements. Underlying goal: provide insight into amblyopia (lazy eye).
- Optimized behavioral parameters and refined surgical techniques for the novel model organism.
- Increased consistency between experiments for reproducibility.
- Streamlined tracking and analysis of dynamical eye movements using multi-planar high-speed imaging and Simi Motion software and increased productivity and output by 40%.
- 3D-reconstructed neuron structure for morphometric analysis using Neurolucida.
JUN 2014 - SEP 2014
David R. Copenhagen Lab @ UCSF
Research Fellowship: UCSF SRTP
Area of Research: Effects of Light Dependent Ca2+ Signaling during retinal development
- Explored the effects of light dependent Ca2+ waves in the developing mouse retina with a focus on the coupling of amacrine cells to melanopsin ganglion cells via gap junctions.
- Characterized appropriate transgenic lines and established baseline comparisons in adult retina to observe and document deviations from the developed animal to the developing animal.
- Performed retinal dissections for cell coupling studies.
Mentorship & Diversity
JUN 2026 - SEP 2021
Mentor for Undergraduates
- Trained and mentored 3 undergraduates on performing research tasks on how to: think independently, plan experiments, perform surgical protocols, and analyze data. Gave career/research advice.
- Post-graduation outcomes of the 3 undergraduates:
JUN 2019 - AUG 2019
UCSF SRTP
Student Advisor,
Developed curriculum for and taught curriculum to teach rising junior and senior undergraduates on:
- how to become a strong graduate school applicant
- how to create compelling posters and presentations
- how to write personal statements
- how to read and dissect scientific papers.
JAN 2016 - JUN 2016
UCSF Science and Health Education Partnership
Student Teacher,
- Created and developed a series of interactive and investigative lesson plans to teach freshman biology.
- Mentored URMs and socioeconomically disadvantaged students on different career paths in science.
MAR 2013 - JUN 2015
UCSD
UC Leadership Through Advanced Degrees Scholar (UCLEADS),
“Mentorship program for underprivileged and socioeconomically disadvantaged undergraduates for success in graduate school to later assume positions of leadership in industry, government, public service, and academia following completion of a doctoral STEM degree”
- Two-way avenue:
- Received mentorship from prior two cohorts as part of the incoming cohort
- Provided mentorship to the next two cohorts while progressing through the program
- Received mentorship from prior two cohorts as part of the incoming cohort
Publications
* denotes equal contribution
Vallania, F. *, Cheung, V. *, Tripathi, A., Louie, M., Snyder, T., Lin, J., Havenith, K., Qin, Y., Pantano, S., Wuerthner, J., van Berkel P.H.; (2023) Discovery of plasma protein biomarkers associated with overall survival in R/R DLBCL patients treated with loncastuximab tesirine. Cancer Res 1 April 2023; 83 (7_Supplement): 5387. https://doi.org/10.1158/1538-7445.AM2023-5387
Vallania, F., Cheung, V., Zamba, MD., Liu, J., Pasupathy, A., Donnella, H., Bailey, M., Louie, M., Lin, J., Havenith, K., Qin, Y., Pantano, S., Wuerthner, J., van Berkel, PH.; Identification of Predictive Biomarkers for Response of R/R DLBCL Patients Treated with Loncastuximab Tesirine Using Low Pass Whole-Genome Sequencing (WGS). Blood 2022; 140 (Supplement 1): 3551–3552. doi: https://doi.org/10.1182/blood-2022-168993
Cheung, V., Chung, P., and Feinberg, E.H. (2022) Transcriptional profiling of mouse projection neurons with VECTORseq STAR Protocols, 3(3):101625
Cheung, V., Chung, P., Bjorni, M., Shvareva, V.A., Lopez, Y.C., and Feinberg, E.H. (2021) Virally Encoded Connectivity Transgenic Overlay RNA sequencing (VECTORseq) defines projection neurons involved in sensorimotor integration. Cell Reports, 37(12):110131
Cheung, V. “Predicting Acute Kidney Injury in Hospitalized Patients Using Machine Learning” Towards Data Science. Medium, 20 Jun. 2020. Web.
Conferences & Talks
Scientific
2023
American Association for Cancer Research (AACR) | Presenter
2022
American Society of Hematology (ASH) | Presenter
2021
UCSF S.O.L.V.E. Health Tech: Digital Health Equity Summit | Attendee
2021
COSYNE (computational and systems neuroscience) | Attendee
2018
UCSF Tetrad | Presenter, 15 min talk
2017
SFN Annual Conference | Attendee
2017
UCSF Tetrad | Presenter, poster presentation
2016
UC LEADs Research Symposium | Attendee
2015
UC LEADs Research Symposium | Presenter, poster presentation
2014
SFN Annual Conference | Presenter, poster presentation
2014
UCSF Summer Research Training Program Symposium | Presenter, poster presentation + 15 min talk
2014
UCSD Academic Enrichment Program | Presenter, poster presentation
2014
SACNAS National Conference | Presenter, poster presentation
2014
UCLEADs Annual Research Symposium | Presenter, poster presentation
2013
UCSD Academic Enrichment Program | Presenter, poster presentation
2013
SACNAS National Conference | Presenter, poster presentation
2013
UCSD STARS Summer Research Conference | Presenter, 15 min talk
Diversity & Outreach
2022
SRTP Co-Curriculum: What Can You Do With a PhD? | Panelist (1hr)
2019
Northern California Forum for Diversity in Graduate Education | Panelist (1hr)
2018
Northern California Forum for Diversity in Graduate Education | Panelist (1hr)
Honors & Awards
2022
UCSF Diversity Graduation, Graduate Division Speaker
2017
Helmsley Scholar
2015
UC LEADs Symposium Presentation Award
2014
SACNAS National Research Conference Travel Scholarship
2013
UCSD STARS Scholarly Presentation Award
2013
SACNAS National Research Conference Travel Scholarship
2013
UCSD Provost Honors
2012
UCSD Provost Honors
2012
Kaiser Permanente Valuable Volunteer Award
2011
UCSD Provost Honors
2011
Kaiser Permanente Student Achievement Award
Media Features
2019
UCSF Poster, First Generation to College
2019
UCSF Article, Students Who Are First in Their Family to Attend College Share Stories, Experiences
— Last updated: Jan 28, 2024