Yajun Mei

Scroll

Professor of Biostatistics

Yajun Mei is a Professor of Biostatistics at NYU/GPH, starting from July 1, 2024. He received the B.S. degree in Mathematics from Peking University, Beijing, China, in 1996, and the Ph.D. degree in Mathematics with a minor in Electrical Engineering from the California Institute of Technology, Pasadena, CA, USA, in 2003. He was a Postdoc in Biostatistics in the renowned Fred Hutch Cancer Center in Seattle, WA during 2003 and 2005. Prior to joining NYU, Dr. Mei was an Assistant/Associate/Full Professor in H. Milton Stewart School of Industrial and Systems Engineering at the Georgia Institute of Technology, Atlanta, GA for 18 years from 2006 to 2024, and had been a co-director of Biostatistics, Epidemiology, and Study Design (BERD) of Georgia CTSA since 2018.

Dr. Mei’s research interests are statistics, machine learning, and data science, and their applications in biomedical science and public health, particularly, streaming data analysis, sequential decision/design, change-point problems, precision/personalized medicine, hot-spots detection for infectious diseases, longitudinal data analysis, bioinformatics, and clinical trials. His work has received several recognitions including Abraham Wald Prizes in Sequential Analysis in both 2009 and 2024, NSF CAREER Award in 2010, an elected Fellow of American Statistical Association (ASA) in 2023, and multiple best paper awards.

Education

BS, Mathematics, Peking University

PhD, Mathematics, California Institute of Technology

Honors and awards

Fellow of American Statistical Association (2023)

Star Research Achievement Award, 2021 Virtual Critical Care Congress (2021)

Best Paper Competition Award, Quality, Statistics & Reliability of INFORMS (2020)

Bronze Snapshot Award, Society of Critical Care Medicine (2019)

NSF Career Award

Thank a Teacher Certificate, Center for Teaching and Learning (2011201220162020202120222023)

Abraham Wald Prize (2009)

Best Paper Award, 11th International Conference on Information Fusion (2008)

New Researcher Fellow, Statistical and Applied Mathematical Sciences Institute (2005)

Fred Hutchinson SPAC Travel Award to attend 2005 Joint Statistical Meetings, Minneapolis, MN (2005)

Travel Award to 8th New Researchers Conference, Minneapolis, MN (2005)

Travel Award to IEEE International Symposium on Information Theory, Chicago, IL (2004)

Travel Award to IPAM workshop on inverse problem, UCLA, Los Angeles, CA (2003)

Fred Hutchinson SPAC Course Scholarship (2003)

Travel Award to the SAMSI workshop on inverse problem, Research Triangular Park, NC (2002)

Publications

Quickest Change Detection and Kullback-Leibler Divergence for Two-State Hidden Markov Models

Fuh, C. D., & Mei, Y. (n.d.).

Publication year

2015

Journal title

IEEE Transactions on Signal Processing

Volume

Issue

Page(s)

4866-4878

10.1109/TSP.2015.2447506

Abstract

Abstract

In this paper, the quickest change detection problem is studied in two-state hidden Markov models (HMM), where the vector parameter θ of the HMM changes from θ0 to θ1 at some unknown time, and one wants to detect the true change as quickly as possible while controlling the false alarm rate. It turns out that the generalized likelihood ratio (GLR) scheme, while theoretically straightforward, is generally computationally infeasible for the HMM. To develop efficient but computationally simple schemes for the HMM, we first discuss a subtlety in the recursive form of the generalized likelihood ratio (GLR) scheme for the HMM. Then we show that the recursive CUSUM scheme proposed in Fuh (Ann. Statist., 2003) can be regarded as a quasi-GLR scheme for pseudo post-change hypotheses with certain dependence structure between pre- and postchange observations. Next, we extend the quasi-GLR idea to propose recursive score schemes in the scenario when the postchange parameter θ1 of the HMM involves a real-valued nuisance parameter. Finally, the Kullback-Leibler (KL) divergence plays an essential role in the quickest change detection problem and many other fields, however it is rather challenging to numerically compute it in HMMs. Here we develop a non-Monte Carlo method that computes the KL divergence of two-state HMMs via the underlying invariant probability measure, which is characterized by the Fredholm integral equation. Numerical study demonstrates an unusual property of the KL divergence for HMM that implies the severe effects of misspecifying the postchange parameter for the HMM.

Discussion on "Change-Points: From Sequential Detection to Biology and Back" by David O. Siegmund

Mei, Y. (n.d.).

Publication year

2013

Journal title

Sequential Analysis

Volume

Issue

Page(s)

32-35

10.1080/07474946.2013.751844

Abstract

Abstract

In his interesting paper, Professor Siegmund illustrates that the problem formulations and methodologies are generally transferable between off-line and on-line settings of change-point problems. In our discussion of his paper, we echo his thoughts with our own experiences.

Quantization effect on the log-likelihood ratio and its application to decentralized sequential detection

Wang, Y., & Mei, Y. (n.d.).

Publication year

2013

Journal title

IEEE Transactions on Signal Processing

Volume

Issue

Page(s)

1536-1543

10.1109/TSP.2012.2237170

Abstract

Abstract

It is well known that quantization cannot increase the Kullback-Leibler divergence which can be thought of as the expected value or first moment of the log-likelihood ratio. In this paper, we investigate the quantization effects on the second moment of the log-likelihood ratio. It is shown via the convex domination technique that quantization may result in an increase in the case of the second moment, but the increase is bounded above by 2/e. The result is then applied to decentralized sequential detection problems not only to provide simpler sufficient conditions for asymptotic optimality theories in the simplest models, but also to shed new light on more complicated models. In addition, some brief remarks on other higher-order moments of the log-likelihood ratio are also provided.

A multistage procedure for decentralized sequential multi-hypothesis testing problems

Wang, Y., & Mei, Y. (n.d.).

Publication year

2012

Journal title

Sequential Analysis

Volume

Issue

Page(s)

505-527

10.1080/07474946.2012.719439

Abstract

Abstract

We studied the problem of sequentially testing M ≥ 2 hypotheses with a decentralized sensor network system. In such a system, the local sensors observe raw data and then send quantized observations to a fusion center, which makes a final decision regarding hypothesis is true. Motivated by the two-stage tests in Wang and Mei (2011), we propose a multistage decentralized sequential test that provides multiple opportunities for the local sensors to adjust to the optimal local quantizers. It is demonstrated that when the hypothesis testing problem is asymmetric, the multistage test is second-order asymptotically optimal. Even though this result constitutes an interesting theoretical improvement over twostage tests that can enjoy only first-order asymptotic optimality, the corresponding practical merits seem to be only marginal. Indeed, performance gains over two-stage procedures with carefully selected thresholds are small.

Asymptotic optimality theory for decentralized sequential multihypothesis testing problems

Wang, Y., & Mei, Y. (n.d.).

Publication year

2011

Journal title

IEEE Transactions on Information Theory

Volume

Issue

Page(s)

7068-7083

10.1109/TIT.2011.2165808

Abstract

Abstract

The Bayesian formulation of sequentially testing M ≥ 3 hypotheses is studied in the context of a decentralized sensor network system. In such a system, local sensors observe raw observations and send quantized sensor messages to a fusion center which makes a final decision when stopping taking observations. Asymptotically optimal decentralized sequential tests are developed from a class of "two-stage" tests that allows the sensor network system to make a preliminary decision in the first stage and then optimize each local sensor quantizer accordingly in the second stage. It is shown that the optimal local quantizer at each local sensor in the second stage can be defined as a maximin quantizer which turns out to be a randomization of at most M-1 unambiguous likelihood quantizers (ULQ). We first present in detail our results for the system with a single sensor and binary sensor messages, and then extend to more general cases involving any finite alphabet sensor messages, multiple sensors, or composite hypotheses.

Early detection of a change in poisson rate after accounting for population size effects

Mei, Y., Won Han, S., & Tsui, K. L. (n.d.).

Publication year

2011

Journal title

Statistica Sinica

Volume

Issue

Page(s)

597-624

10.5705/ss.2011.027a

Abstract

Abstract

Motivated by applications in bio and syndromic surveillance, this article is concerned with the problem of detecting a change in the mean of Poisson distributions after taking into account the effects of population size. The family of generalized likelihood ratio (GLR) schemes is proposed and its asymptotic optimality properties are established under the classical asymptotic setting. However, numerical simulation studies illustrate that the GLR schemes are at times not as efficient as two families of ad-hoc schemes based on either the weighted likelihood ratios or the adaptive threshold method that adjust the effects of population sizes. To explain this, a further asymptotic optimality analysis is developed under a new asymptotic setting that is more suitable to our finite-sample numerical simulations. In addition, we extend our approaches to a general setting with arbitrary probability distributions, as well as to the continuous-time setting involving the multiplicative intensity models for Poisson processes, but further research is needed.

Discussion on "Quickest detection problems: Fifty years later" by Albert N. Shiryaev

Mei, Y. (n.d.).

Publication year

2010

Journal title

Sequential Analysis

Volume

Issue

Page(s)

410-414

10.1080/07474946.2010.520609

Abstract

Abstract

In his interesting article Professor Shiryaev reviewed how he got motivated from real-applications of target detection in radar systems to develop sequential changepoint detection theory and also described different approaches to formulate mathematical problems. In our discussion of his article we focus on a couple of new real-world applications of sequential change-point detection and some new challenges as well.

Efficient scalable schemes for monitoring a large number of data streams

Mei, Y. (n.d.).

Publication year

2010

Journal title

Biometrika

Volume

Issue

Page(s)

419-433

10.1093/biomet/asq010

Abstract

Abstract

The sequential changepoint detection problem is studied in the context of global online monitoring of a large number of independent data streams. We are interested in detecting an occurring event as soon as possible, but we do not know when the event will occur, nor do we know which subset of data streams will be affected by the event. A family of scalable schemes is proposed based on the sum of the local cumulative sum, cusum, statistics from each individual data stream, and is shown to asymptotically minimize the detection delays for each and every possible combination of affected data streams, subject to the global false alarm constraint. The usefulness and limitations of our asymptotic optimality results are illustrated by numerical simulations and heuristic arguments. The Appendices contain a probabilistic result on the first epoch to simultaneous record values for multiple independent random walks.

Linear-mixed effects models for feature selection in high-dimensional NMR spectra

Mei, Y., Kim, S. B., & Tsui, K. L. (n.d.).

Publication year

2009

Journal title

Expert Systems with Applications

Volume

Issue

Page(s)

4703-4708

10.1016/j.eswa.2008.06.032

Abstract

Abstract

Feature selection in metabolomics can identify important metabolite features that play a significant role in discriminating between various conditions among samples. In this paper, we propose an efficient feature selection method for high-resolution nuclear magnetic resonance (NMR) spectra obtained from time-course experiments. Our proposed approach combines linear-mixed effects (LME) models with a multiple testing procedure based on a false discovery rate. The proposed LME approach is illustrated using NMR spectra with 574 metabolite features obtained for an experiment to examine metabolic changes in response to sulfur amino acid intake. The experimental results showed that classification models constructed with the features selected by the proposed approach resulted in lower rates of misclassification than those models with full features. Furthermore, we compared the LME approach with the two-sample t-test approach that oversimplifies the time-course factor.

A comparison of methods for determining HIV viral set point

Mei, Y., Wang, L., & Holte, S. E. (n.d.).

Publication year

2008

Journal title

Statistics in Medicine

Volume

Issue

Page(s)

121-139

10.1002/sim.3038

Abstract

Abstract

During a course of human immunodeficiency virus (HIV-1) infection, the viral load usually increases sharply to a peak following infection and then drops rapidly to a steady state, where it remains until progression to AIDS. This steady state is often referred to as the viral set point. It is believed that the HIV viral set point results from an equilibrium between the HIV virus and immune response and is an important indicator of AIDS disease progression. In this paper, we analyze a real data set of viral loads measured before antiretroviral therapy is initiated, and propose two-phase regression models to utilize all available data to estimate the viral set point. The advantages of the proposed methods are illustrated by comparing them with two empirical methods, and the reason behind the improvement is also studied. Our results illustrate that for our data set, the viral load data are highly correlated and it is cost effective to estimate the viral set point based on one or two measurements obtained between 5 and 12 months after HIV infection. The utility and limitations of this recommendation will be discussed.

Asymptotic optimality theory for decentralized sequential hypothesis testing in sensor networks

Mei, Y. (n.d.).

Publication year

2008

Journal title

IEEE Transactions on Information Theory

Volume

Issue

Page(s)

2072-2089

10.1109/TIT.2008.920217

Abstract

Abstract

The decentralized sequential hypothesis testing problem is studied in sensor networks, where a set of sensors receive independent observations and send summary messages to the fusion center, which makes a final decision. In the scenario where the sensors have full access to their past observations, the first asymptotically Bayes sequential test is developed having the same asymptotic performance as the optimal centralized test that has access to all sensor observations. Next, in the scenario where the sensors do not have full access to their past observations, a simple but asymptotically Bayes sequential tests is developed, in which sensor message functions are what we call tandem quantizer, where each sensor only uses two different sensor quantizers with at most one switch between these two possibilities. Moreover, a new minimax formulation of optimal stationary sensor quantizers is proposed and is studied in detail in the case of additive Gaussian sensor noise. Finally, our results show that in the simplest models, feedback from the fusion center does not improve asymptotic performance in the scenario with full local memory, however, even a one-shot, one-bit feedback can significantly improve performance in the case of limited local memory.

Is average run length to false alarm always an informative criterion?

Mei, Y. (n.d.).

Publication year

2008

Journal title

Sequential Analysis

Volume

Issue

Page(s)

354-376

10.1080/07474940802445790

Abstract

Abstract

Apart from Bayesian approaches, the average run length (ARL) to false alarm has always been seen as the natural performance criterion for quantifying the propensity of a detection scheme to make false alarms, and no researchers seem to have questioned this on grounds that it does not always apply. In this article, we show that in the change-point problem with mixture prechange models, detection schemes with finite detection delays can have infinite ARLs to false alarm. We also discuss the implication of our results on the change-point problem with either exchangeable prechange models or hidden Markov models. Alternative minimax formulations with different false alarm criteria are proposed.

Sample size calculation for the van Elteren test adjusting for ties

Zhao, Y. D., Rahardja, D., & Mei, Y. (n.d.).

Publication year

2008

Journal title

Journal of Biopharmaceutical Statistics

Volume

Issue

Page(s)

1112-1119

10.1080/10543400802369020

Abstract

Abstract

In this article we study sample size calculation methods for the asymptotic van Elteren test. Because the existing methods are only applicable to continuous data without ties, in this article we develop a new method that can be used on ordinal data. The new method has a closed form formula and is very easy to calculate. The new sample size formula performs very well because our simulations show that the corresponding actual powers are close to the nominal powers.

A discussion on "Detection of intrusions in information systems by sequential change-point methods" by Tartakovsky, Rozovskii, Blažek, and Kim

Mei, Y. (n.d.).

Publication year

2006

Journal title

Statistical Methodology

Volume

Issue

Page(s)

304-306

10.1016/j.stamet.2005.06.009

Comments on "a note on optimal detection of a change in distribution," by benjamin Yakir

Mei, Y. (n.d.).

Publication year

2006

Journal title

Annals of Statistics

Volume

Issue

Page(s)

1570-1576

10.1214/009053606000000362

Abstract

Abstract

The purpose of this note is to show that in a widely cited paper by Yakir [Ann. Statist. 25 (1997) 2117-2126], the proof that the so-called modified Shiryayev-Roberts procedure is exactly optimal is incorrect. We also clarify the issues involved by both mathematical arguments and a simulation study. The correctness of the theorem remains in doubt.

Sequential change-point detection when unknown parameters are present in the pre-change distribution

Mei, Y. (n.d.).

Publication year

2006

Journal title

Annals of Statistics

Volume

Issue

Page(s)

92-122

10.1214/009053605000000859

Abstract

Abstract

In the sequential change-point detection literature, most research specifies a required frequency of false alarms at a given pre-change distribution f θ and tries to minimize the detection delay for every possible post-change distribution g λ. In this paper, motivated by a number of practical examples, we first consider the reverse question by specifying a required detection delay at a given post-change distribution and trying to minimize the frequency of false alarms for every possible pre-change distribution f θ. We present asymptotically optimal procedures for one-parameter exponential families. Next, we develop a general theory for change-point problems when both the prechange distribution f θ and the post-change distribution g λ involve unknown parameters. We also apply our approach to the special case of detecting shifts in the mean of independent normal observations.

Suboptimal properties of page's CUSUM and shiryayev-roberts procedures in change-point problems with dependent observations

Mei, Y. (n.d.).

Publication year

2006

Journal title

Statistica Sinica

Volume

Issue

Page(s)

883-897

Abstract

Abstract

We construct a simple counterexample to the conjectures of Pollak (1985) and Yakir, Krieger and Pollak (1999), which state that Page's CUSUM procedure and the Shiryayev-Roberts procedure are asymptotically minimax optimal for dependent observations. Moreover, our example shows that the close relationship between open-ended tests and change-point detection procedures no longer holds for dependent observations. As a consequence, the standard approach which constructs change-point detection procedures based on asymptotically optimal openended tests does not in general provide asymptotically optimal procedures for dependent observations.

Information bounds and quickest change detection in decentralized decision systems

Mei, Y. (n.d.).

Publication year

2005

Journal title

IEEE Transactions on Information Theory

Volume

Issue

Page(s)

2669-2681

10.1109/TIT.2005.850159

Abstract

Abstract

The quickest change detection problem is studied in decentralized decision systems, where a set of sensors receive independent observations and send summary messages to the fusion center, which makes a final decision. In the system where the sensors do not have access to their past observations, the previously conjectured asymptotic optimality of a procedure with a monotone likelihood ratio quantizer (MLRQ) is proved. In the case of additive Gaussian sensor noise, if the signal-to-noise ratios (SNR) at some sensors are sufficiently high, this procedure can perform as well as the optimal centralized procedure that has access to all the sensor observations. Even if all SNRs are low, its detection delay will be at most π/2 - 1 ≈ 57% larger than that of the optimal centralized procedure. Next, in the system where the sensors have full access to their past observations, the first asymptotically optimal procedure in the literature is developed. Surprisingly, the procedure has the same asymptotic performance as the optimal centralized procedure, although it may perform poorly in some practical situations because of slow asymptotic convergence. Finally, it is shown that neither past message information nor the feedback from the fusion center improves the asymptotic performance in the simplest model.

Yajun Mei

Yajun Mei

Professor of Biostatistics

Professional overview

Education

Honors and awards

Publications

Publications

Quickest Change Detection and Kullback-Leibler Divergence for Two-State Hidden Markov Models

Publication year

Journal title

Volume

Issue

Page(s)

Discussion on "Change-Points: From Sequential Detection to Biology and Back" by David O. Siegmund

Publication year

Journal title

Volume

Issue

Page(s)

Quantization effect on the log-likelihood ratio and its application to decentralized sequential detection

Publication year

Journal title

Volume

Issue

Page(s)

A multistage procedure for decentralized sequential multi-hypothesis testing problems

Publication year

Journal title

Volume

Issue

Page(s)

Asymptotic optimality theory for decentralized sequential multihypothesis testing problems

Publication year

Journal title

Volume

Issue

Page(s)

Early detection of a change in poisson rate after accounting for population size effects

Publication year

Journal title

Volume

Issue

Page(s)

Discussion on "Quickest detection problems: Fifty years later" by Albert N. Shiryaev

Publication year

Journal title

Volume

Issue

Page(s)

Efficient scalable schemes for monitoring a large number of data streams

Publication year

Journal title

Volume

Issue

Page(s)

Linear-mixed effects models for feature selection in high-dimensional NMR spectra

Publication year

Journal title

Volume

Issue

Page(s)

A comparison of methods for determining HIV viral set point

Publication year

Journal title

Volume

Issue

Page(s)

Asymptotic optimality theory for decentralized sequential hypothesis testing in sensor networks

Publication year

Journal title

Volume

Issue

Page(s)

Is average run length to false alarm always an informative criterion?

Publication year

Journal title

Volume

Issue

Page(s)