BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//UC Irvine Donald Bren School of Information &amp; Computer Sciences - ECPv6.3.4//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:UC Irvine Donald Bren School of Information &amp; Computer Sciences
X-ORIGINAL-URL:https://ics.uci.edu
X-WR-CALDESC:Events for UC Irvine Donald Bren School of Information &amp; Computer Sciences
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20260308T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20261101T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20260507T160000
DTEND;TZID=America/Los_Angeles:20260507T170000
DTSTAMP:20260613T121917
CREATED:20260305T171145Z
LAST-MODIFIED:20260417T202907Z
UID:28157-1778169600-1778173200@ics.uci.edu
SUMMARY:From Coevolution Signals to Mutation Effects: Statistically Calibrated Protein Contact Maps and Fitness Prediction
DESCRIPTION:Protein sequence data offer a massive natural experiment: across evolution\, residue positions co-vary in ways that encode 3D structure\, illuminate key evolutionary constraints\, and reveal compensatory mutation mechanisms. Yet many widely used coevolution methods remain primarily algorithmic–powerful in practice\, but often lacking calibrated uncertainty and rigorous theoretical guarantees. In this talk\, we introduce a statistically grounded toolkit that transforms multiple sequence alignments (MSAs)–high-dimensional\, dependent categorical data–into (i) principled contact maps with error control and (ii) quantitatively reliable predictions of mutation effects. First\, we recast contact prediction as hypothesis testing for conditional dependence in high-dimensional categorical data. Using one-hot encoded MSAs\, we construct a partial-correlation-style graph and propose a new spectrum-based test statistic that enables statistically calibrated contact discovery. The framework further identifies the specific amino-acid combinations driving each detected interaction\, providing a new layer of interpretability for coevolution signals. Next\, we develop a Potts-model framework for mutation-effect modeling via node-wise high-dimensional multinomial regression. Our approach enforces sparsity both across residue pairs and across amino-acid types through sparse-group regularization\, and it incorporates structural information by weighting penalties across site pairs. We establish sharp L2 convergence rates for the estimated Potts parameters\, which in turn yield trustworthy estimates of evolutionary energies and mutation-induced energy changes. Across multiple protein families\, our methods improve mutation fitness prediction when benchmarked against high-throughput mutagenesis experiments.
URL:https://ics.uci.edu/event/from-coevolution-signals-to-mutation-effects-statistically-calibrated-protein-contact-maps-and-fitness-prediction/
LOCATION:Donald Bren Hall\, Irvine\, CA\, 92697\, United States
ATTACH;FMTTYPE=image/jpeg:https://ics.uci.edu/wp-content/uploads/2026/03/Wen-Zhou-resize.jpg
END:VEVENT
END:VCALENDAR