Yajun Mei

Scroll

Professor of Biostatistics

Yajun Mei is a Professor of Biostatistics at NYU/GPH, starting from July 1, 2024. He received the B.S. degree in Mathematics from Peking University, Beijing, China, in 1996, and the Ph.D. degree in Mathematics with a minor in Electrical Engineering from the California Institute of Technology, Pasadena, CA, USA, in 2003. He was a Postdoc in Biostatistics in the renowned Fred Hutch Cancer Center in Seattle, WA during 2003 and 2005. Prior to joining NYU, Dr. Mei was an Assistant/Associate/Full Professor in H. Milton Stewart School of Industrial and Systems Engineering at the Georgia Institute of Technology, Atlanta, GA for 18 years from 2006 to 2024, and had been a co-director of Biostatistics, Epidemiology, and Study Design (BERD) of Georgia CTSA since 2018.

Dr. Mei’s research interests are statistics, machine learning, and data science, and their applications in biomedical science and public health, particularly, streaming data analysis, sequential decision/design, change-point problems, precision/personalized medicine, hot-spots detection for infectious diseases, longitudinal data analysis, bioinformatics, and clinical trials. His work has received several recognitions including Abraham Wald Prizes in Sequential Analysis in both 2009 and 2024, NSF CAREER Award in 2010, an elected Fellow of American Statistical Association (ASA) in 2023, and multiple best paper awards.

Education

BS, Mathematics, Peking University

PhD, Mathematics, California Institute of Technology

Honors and awards

Fellow of American Statistical Association (2023)

Star Research Achievement Award, 2021 Virtual Critical Care Congress (2021)

Best Paper Competition Award, Quality, Statistics & Reliability of INFORMS (2020)

Bronze Snapshot Award, Society of Critical Care Medicine (2019)

NSF Career Award

Thank a Teacher Certificate, Center for Teaching and Learning (2011201220162020202120222023)

Abraham Wald Prize (2009)

Best Paper Award, 11th International Conference on Information Fusion (2008)

New Researcher Fellow, Statistical and Applied Mathematical Sciences Institute (2005)

Fred Hutchinson SPAC Travel Award to attend 2005 Joint Statistical Meetings, Minneapolis, MN (2005)

Travel Award to 8th New Researchers Conference, Minneapolis, MN (2005)

Travel Award to IEEE International Symposium on Information Theory, Chicago, IL (2004)

Travel Award to IPAM workshop on inverse problem, UCLA, Los Angeles, CA (2003)

Fred Hutchinson SPAC Course Scholarship (2003)

Travel Award to the SAMSI workshop on inverse problem, Research Triangular Park, NC (2002)

Publications

Impact of compensation coefficients on active sequential change point detection

Xu, Q., Mei, Y., & Shi, J. (n.d.).

Publication year

2025

Journal title

Sequential Analysis

Volume

Issue

Page(s)

153-177

10.1080/07474946.2024.2448819

Abstract

Abstract

Under a general setting of active sequential change point detection problems, there are p local streams in a system but we are only able to take observations from q out of these p local streams at each time instant owing to the sampling control constraint. At some unknown change time, an undesired event occurs to the system and changes the local distributions from f to g for a subset of s unknown local streams. The objective is determining how to adaptively sample local streams and decide when to raise a global alarm, so that we can detect the correct change as quickly as possible subject to the false alarm constraint. One efficient algorithm is the TRAS algorithm proposed in Liu et al. (2015), which incorporates an idea of compensation coefficients for unobserved data streams. However, it is unclear how to choose the compensation coefficients suitably from a theoretical point of view to balance the trade-off between the detection delay and false alarm. In this article, we investigate the impact of compensation coefficients on the TRAS algorithm. Our main contributions are twofold. On the one hand, under the general setting, we prove that if the compensation coefficient is larger than (Formula presented.) where I(f, g) is the Kullback-Leibler divergence, then the TRAS algorithm is suboptimal in the sense of having too large detection delays. On the other hand, under the special case of (Formula presented.) if the compensation coefficient is small enough, then the TRAS algorithm is efficient to detect when the change occurs at time ν = 0. Though it remains an open problem to develop general asymptotic optimality theorems, our results shed lights how to tune compensation coefficients suitably in real-world applications, and extensive numerical studies are conducted to validate our results.

Jugular Venous Catheterization is Not Associated with Increased Complications in Patients with Aneurysmal Subarachnoid Hemorrhage

Akbik, F., Shi, Y., Philips, S., Pimentel-Farias, C., Grossberg, J. A., Howard, B. M., Tong, F., Cawley, C. M., Samuels, O. B., Mei, Y., & Sadan, O. (n.d.).

Publication year

2025

Journal title

Neurocritical Care

Volume

Issue

Page(s)

929-936

10.1007/s12028-024-02173-1

Abstract

Abstract

Background: Classic teaching in neurocritical care is to avoid jugular access for central venous catheterization (CVC) because of a presumed risk of increasing intracranial pressure (ICP). Limited data exist to test this hypothesis. Aneurysmal subarachnoid hemorrhage (aSAH) leads to diffuse cerebral edema and often requires external ventricular drains (EVDs), which provide direct ICP measurements. Here, we test whether CVC access site correlates with ICP measurements and catheter-associated complications in patients with aSAH. Methods: In a single-center retrospective cohort study, patients with aSAH admitted to Emory University Hospital between January 1, 2012, through December 31, 2020, were included. Patients were assigned by the access site of the first CVC placed. The subset of patients with an EVD were further studied. ICP measurements were analyzed using linear mixed effect models, with a binary comparison between internal-jugular (IJ) versus non-IJ access. Results: A total of 1577 patients were admitted during the study period with CVC access: subclavian (SC) (887, 56.2%), IJ (365, 23.1%), femoral (72, 4.6%), and peripheral inserted central catheter (PICC) (253, 16.0%). Traumatic pneumothorax was the most common with SC access (3.0%, p < 0.01). Catheter-associated infections did not differ between sites. Catheter-associated deep venous thrombosis was most common in femoral (8.3%) and PICC (3.6%) access (p < 0.05). A total of 1220 patients had an EVD, remained open by default, generating 351,462 ICP measurements. ICP measurements, as compared over the first 24–postinsertion hours and the next 10 days, were similar between the two groups. Subgroup analysis accounting for World Federation of Neurological Surgeons grade on presentation yielded similar results. Conclusions: Contrary to classic teaching, we find that IJ CVC placement was not associated with increased ICP in the clinical context of the largest, quantitative data set to date. Further, IJ access was the least likely to be associated with an access-site complication when compared with SC, femoral, and PICC. Together, these data support the safety, and perhaps preference, of ultrasound-guided IJ venous catheterization in neurocritically ill patients.

Cost-efficient fixed-width confidence intervals for the difference of two Bernoulli proportions

Erazo, I., Goldsman, D., & Mei, Y. (n.d.).

Publication year

2024

Journal title

Journal of Simulation

Volume

Issue

Page(s)

726-744

10.1080/17477778.2023.2251931

Abstract

Abstract

We study the properties of confidence intervals (CIs) for the difference of two Bernoulli distributions’ success parameters, (Formula presented.), in the case where the goal is to obtain a CI of a given half-width while minimising sampling costs when the observation costs may be different between the two distributions. We propose three different methods for constructing fixed-width CIs: (i) a two-stage sampling procedure, (ii) a sequential method that carries out sampling in batches, and (iii) an (Formula presented.) -stage “look-ahead” procedure. Under diverse scenarios, our proposed algorithms obtain significant cost savings versus their baseline counterparts. Furthermore, for the scenarios under study, our sequential-batches and (Formula presented.) -stage “look-ahead” procedures approximately obtain the nominal coverage while meeting the desired width requirement. Our sequential-batching method is more efficient than the “look-ahead” method computationally, with average running times an order-of-magnitude faster over the scenarios tested. We illustrate our procedures on a case study comparing generic and brand-name drugs.

Directional false discovery rate control in large-scale multiple comparisons

Liang, W., Xiang, D., Mei, Y., & Li, W. (n.d.).

Publication year

2024

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

3195-3214

10.1080/02664763.2024.2344260

Abstract

Abstract

The advance of high-throughput biomedical technology makes it possible to access massive measurements of gene expression levels. An important statistical issue is identifying both under-expressed and over-expressed genes for a disease. Most existing multiple-testing procedures focus on selecting only the non-null or significant genes without further identifying their expression type. Only limited methods are designed for the directional problem, and yet they fail to separately control the numbers of falsely discovered over-expressed and under-expressed genes with only a unified index combining all the false discoveries. In this paper, based on a three-classification multiple testing framework, we propose a practical data-driven procedure to control separately the two directions of false discoveries. The proposed procedure is theoretically valid and optimal in the sense that it maximizes the expected number of true discoveries while controlling the false discovery rates for under-expressed and over-expressed genes simultaneously. The procedure allows different nominal levels for the two directions, exhibiting high flexibility in practice. Extensive numerical results and analysis of two large-scale genomic datasets show the effectiveness of our procedure.

Pharmacologic Venous Thromboembolism Prophylaxis in Patients with Nontraumatic Subarachnoid Hemorrhage Requiring an External Ventricular Drain

Ukpabi, C., Sadan, O., Shi, Y., Greene, K. N., Samuels, O., Mathew, S., Joy, J., Mei, Y., & Asbury, W. (n.d.).

Publication year

2024

Journal title

Neurocritical Care

Volume

Issue

Page(s)

779-787

10.1007/s12028-024-01993-5

Abstract

Abstract

Background: Optimal pharmacologic thromboprophylaxis dosing is not well described in patients with subarachnoid hemorrhage (SAH) with an external ventricular drain (EVD). Our patients with SAH with an EVD who receive prophylactic enoxaparin are routinely monitored using timed anti-Xa levels. Our primary study goal was to determine the frequency of venous thromboembolism (VTE) and secondary intracranial hemorrhage (ICH) for this population of patients who received pharmacologic prophylaxis with enoxaparin or unfractionated heparin (UFH). Methods: A retrospective chart review was performed for all patients with SAH admitted to the neurocritical care unit at Emory University Hospital between 2012 and 2017. All patients with SAH who required an EVD were included. Results: Of 1,351 patients screened, 868 required an EVD. Of these 868 patients, 627 received enoxaparin, 114 received UFH, and 127 did not receive pharmacologic prophylaxis. VTE occurred in 7.5% of patients in the enoxaparin group, 4.4% in the UFH group (p = 0.32), and 3.2% in the no VTE prophylaxis group (p = 0.08). Secondary ICH occurred in 3.83% of patients in the enoxaparin group, 3.51% in the UFH group (p = 1), and 3.94% in the no VTE prophylaxis group (p = 0.53). As steady-state anti-Xa levels increased from 0.1 units/mL to > 0.3 units/mL, there was a trend toward a lower incidence of VTE. However, no correlation was noted between rising anti-Xa levels and an increased incidence of secondary ICH. When compared, neither enoxaparin nor UFH use was associated with a significantly reduced incidence of VTE or an increased incidence of ICH. Conclusions: In this retrospective study of patients with nontraumatic SAH with an EVD who received enoxaparin or UFH VTE prophylaxis or no VTE prophylaxis, there was no statistically significant difference in the incidence of VTE or secondary ICH. For patients receiving prophylactic enoxaparin, achieving higher steady-state target anti-Xa levels may be associated with a lower incidence of VTE without increasing the risk of secondary ICH.

Rollout designs for lump-sum data

Xu, Q., Tian, H., Sarkar, A., & Mei, Y. (n.d.).

Publication year

2024

Journal title

Journal of Applied Statistics

10.1080/02664763.2024.2440031

Abstract

Abstract

This work studies rollout design problems with a focus of suitable choices of rollout rate under the standard Type I and Type II error probabilities control framework. The main challenge of rollout design is that data is often observed in a lump-sum manner from a spatio-temporal point of view: (1) temporally, only the sum of data in a given sliding time window can be observed; (2) spatially, there are two subgroups for the data at each time step: control and treatment, but one can only observe the total values instead of individual values from each subgroup. We develop rollout tests of lump-sum data under both fixed-sample-size and sequential settings, subject to the constraints on Type I and Type II error probabilities. Numerical studies are conducted to validate our theoretical results.

Active learning-based multistage sequential decision-making model with application on common bile duct stone evaluation

Tian, H., Cohen, R. Z., Zhang, C., & Mei, Y. (n.d.).

Publication year

2023

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

2951-2969

10.1080/02664763.2023.2164885

Abstract

Abstract

Multistage sequential decision-making occurs in many real-world applications such as healthcare diagnosis and treatment. One concrete example is when the doctors need to decide to collect which kind of information from subjects so as to make the good medical decision cost-effectively. In this paper, an active learning-based method is developed to model the doctors' decision-making process that actively collects necessary information from each subject in a sequential manner. The effectiveness of the proposed model, especially its two-stage version, is validated on both simulation studies and a case study of common bile duct stone evaluation for pediatric patients.

Adaptive resources allocation CUSUM for binomial count data monitoring with application to COVID-19 hotspot detection

Hu, J., Mei, Y., Holte, S., & Yan, H. (n.d.).

Publication year

2023

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

2889-2913

10.1080/02664763.2022.2117288

Abstract

Abstract

In this paper, we present an efficient statistical method (denoted as ‘Adaptive Resources Allocation CUSUM’) to robustly and efficiently detect the hotspot with limited sampling resources. Our main idea is to combine the multi-arm bandit (MAB) and change-point detection methods to balance the exploration and exploitation of resource allocation for hotspot detection. Further, a Bayesian weighted update is used to update the posterior distribution of the infection rate. Then, the upper confidence bound (UCB) is used for resource allocation and planning. Finally, CUSUM monitoring statistics to detect the change point as well as the change location. For performance evaluation, we compare the performance of the proposed method with several benchmark methods in the literature and showed the proposed algorithm is able to achieve a lower detection delay and higher detection precision. Finally, this method is applied to hotspot detection in a real case study of county-level daily positive COVID-19 cases in Washington State WA) and demonstrates the effectiveness with very limited distributed samples.

Asymptotic optimality theory for active quickest detection with unknown postchange parameters

Xu, Q., & Mei, Y. (n.d.).

Publication year

2023

Journal title

Sequential Analysis

Volume

Issue

Page(s)

150-181

10.1080/07474946.2023.2187417

Abstract

Abstract

The active quickest detection problem with unknown postchange parameters is studied under the sampling control constraint, where there are p local streams in a system but one is only able to take observations from one and only one of these p local streams at each time instant. The objective is to raise a correct alarm as quickly as possible once the change occurs subject to both false alarm and sampling control constraints. Here we assume that exactly one of the p local streams is affected, and the postchange distribution involves unknown parameters. In this context, we propose an efficient greedy cyclic sampling–based quickest detection algorithm and show that our proposed algorithm is asymptotically optimal in the sense of minimizing the detection delay under both false alarm and sampling control constraints. Numerical studies are conducted to show the effectiveness and applicability of the proposed algorithm.

Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control

Zhang, W., & Mei, Y. (n.d.).

Publication year

2023

Journal title

Technometrics

Volume

Issue

Page(s)

33-43

10.1080/00401706.2022.2054861

Abstract

Abstract

In many real-world problems of real-time monitoring high-dimensional streaming data, one wants to detect an undesired event or change quickly once it occurs, but under the sampling control constraint in the sense that one might be able to only observe or use selected components data for decision-making per time step in the resource-constrained environments. In this article, we propose to incorporate multi-armed bandit approaches into sequential change-point detection to develop an efficient bandit change-point detection algorithm based on the limiting Bayesian approach to incorporate a prior knowledge of potential changes. Our proposed algorithm, termed Thompson-Sampling-Shiryaev-Roberts-Pollak (TSSRP), consists of two policies per time step: the adaptive sampling policy applies the Thompson Sampling algorithm to balance between exploration for acquiring long-term knowledge and exploitation for immediate reward gain, and the statistical decision policy fuses the local Shiryaev–Roberts–Pollak statistics to determine whether to raise a global alarm by sum shrinkage techniques. Extensive numerical simulations and case studies demonstrate the statistical and computational efficiency of our proposed TSSRP algorithm.

CSSQ: a ChIP-seq signal quantifier pipeline

Kumar, A., Hu, M. Y., Mei, Y., & Fan, Y. (n.d.).

Publication year

2023

Journal title

Frontiers in Cell and Developmental Biology

Volume

10.3389/fcell.2023.1167111

Abstract

Abstract

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) has revolutionized the studies of epigenomes and the massive increase in ChIP-seq datasets calls for robust and user-friendly computational tools for quantitative ChIP-seq. Quantitative ChIP-seq comparisons have been challenging due to noisiness and variations inherent to ChIP-seq and epigenomes. By employing innovative statistical approaches specially catered to ChIP-seq data distribution and sophisticated simulations along with extensive benchmarking studies, we developed and validated CSSQ as a nimble statistical analysis pipeline capable of differential binding analysis across ChIP-seq datasets with high confidence and sensitivity and low false discovery rate with any defined regions. CSSQ models ChIP-seq data as a finite mixture of Gaussians faithfully that reflects ChIP-seq data distribution. By a combination of Anscombe transformation, k-means clustering, estimated maximum normalization, CSSQ minimizes noise and bias from experimental variations. Further, CSSQ utilizes a non-parametric approach and incorporates comparisons under the null hypothesis by unaudited column permutation to perform robust statistical tests to account for fewer replicates of ChIP-seq datasets. In sum, we present CSSQ as a powerful statistical computational pipeline tailored for ChIP-seq data quantitation and a timely addition to the tool kits of differential binding analysis to decipher epigenomes.

Editorial to the special issue: modern streaming data analytics

Mei, Y., Bartroff, J., Chen, J., Fellouris, G., & Zhang, R. (n.d.).

Publication year

2023

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

2857-2861

10.1080/02664763.2023.2247646

Hot-spots detection in count data by Poisson assisted smooth sparse tensor decomposition

Zhao, Y., Huo, X., & Mei, Y. (n.d.).

Publication year

2023

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

2999-3029

10.1080/02664763.2022.2112557

Abstract

Abstract

Count data occur widely in many bio-surveillance and healthcare applications, e.g. the numbers of new patients of different types of infectious diseases from different cities/counties/states repeatedly over time, say, daily/weekly/monthly. For this type of count data, one important task is the quick detection and localization of hot-spots in terms of unusual infectious rates so that we can respond appropriately. In this paper, we develop a method called Poisson assisted Smooth Sparse Tensor Decomposition (PoSSTenD), which not only detect when hot-spots occur but also localize where hot-spots occur. The main idea of our proposed PoSSTenD method is articulated as follows. First, we represent the observed count data as a three-dimensional tensor including (1) a spatial dimension for location patterns, e.g. different cities/countries/states; (2) a temporal domain for time patterns, e.g. daily/weekly/monthly; (3) a categorical dimension for different types of data sources, e.g. different types of diseases. Second, we fit this tensor into a Poisson regression model, and then we further decompose the infectious rate into two components: smooth global trend and local hot-spots. Third, we detect when hot-spots occur by building a cumulative sum (CUSUM) control chart and localize where hot-spots occur by their LASSO-type sparse estimation. The usefulness of our proposed methodology is validated through numerical simulation studies and a real-world dataset, which records the annual number of 10 different infectious diseases from 1993 to 2018 for 49 mainland states in the United States.

Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

Fang, E. X., Mei, Y., Shi, Y., Xu, Q., & Zhao, T. (n.d.).

Publication year

2023

Journal title

Journal of Machine Learning Research

Volume

Abstract

Abstract

We consider the linear discriminant analysis problem in the high-dimensional settings. In this work, we propose PANDA(PivotAl liNear Discriminant Analysis), a tuning-insensitive method in the sense that it requires very little effort to tune the parameters. Moreover, we prove that PANDA achieves the optimal convergence rate in terms of both the estimation error and misclassification rate. Our theoretical results are backed up by thorough numerical studies using both simulated and real datasets. In comparison with the existing methods, we observe that our proposed PANDA yields equal or better performance, and requires substantially less effort in parameter tuning.

Repetitive Low-level Blast Exposure and Neurocognitive Effects in Army Ranger Mortarmen

Woodall, J. L., Sak, J. A., Cowdrick, K. R., Muñoz, B. M., McElrath, J. H., Trimpe, G. R., Mei, Y., Myhre, R. L., Rains, J. K., & Hutchinson, C. R. (n.d.).

Publication year

2023

Journal title

Military Medicine

Volume

188

Issue

Page(s)

E771-E779

10.1093/milmed/usab394

Abstract

Abstract

Introduction: Occupational exposure to repetitive, low-level blasts in military training and combat has been tied to subconcussive injury and poor health outcomes for service members. Most low-level blast studies to date have focused on explosive breaching and firing heavy weapon systems; however, there is limited research on the repetitive blast exposure and physiological effects that mortarmen experience when firing mortar weapon systems. Motivated by anecdotal symptoms of mortarmen, the purpose of this paper is to characterize this exposure and its resulting neurocognitive effects in order to provide preliminary findings and actionable recommendations to safeguard the health of mortarmen. Materials and Methods: In collaboration with the U.S. Army Rangers at Fort Benning, blast exposure, symptoms, and pupillary light reflex were measured during 3 days of firing 81 mm and 120 mm mortars in training. Blast exposure analysis included the examination of the blast overpressure (BOP) and cumulative exposure by mortarman position, as well as comparison to the 4 psi safety threshold. Pupillary light reflex responses were analyzed with linear mixed effects modeling. All neurocognitive results were compared between mortarmen (n=11) and controls (n=4) and cross-compared with blast exposure and blast history. Results: Nearly 500 rounds were fired during the study, resulting in a high cumulative blast exposure for all mortarmen. While two mortarmen had average BOPs exceeding the 4 psi safety limit (Fig. 2), there was a high prevalence of mTBIlike symptoms among all mortarmen, with over 70% experiencing headaches, ringing in the ears, forgetfulness/poor memory, and taking longer to think during the training week (n≥8/11). Mortarmen also had smaller and slower pupillary light reflex responses relative to controls, with significantly slower dilation velocity (P<0.05) and constriction velocity (P<0.10). Conclusion: Mortarmen experienced high cumulative blast exposure coinciding with altered neurocognition that is suggestive of blast-related subconcussive injury. These neurocognitive effects occurred even in mortarmen with average BOP below the 4 psi safety threshold. While this study was limited by a small sample size, its results demonstrate a concerning health risk for mortarmen that requires additional study and immediate action. Behavioral changes like ducking and standing farther from the mortar when firing can generally help reduce mortarmen BOP exposure, but we recommend the establishment of daily cumulative safety thresholds and daily firing limits in training to reduce cumulative blast exposure, and ultimately, improve mortarmen's quality of life and longevity in service.

Surge Capacity in the COVID-19 Era: a Natural Experiment of Neurocritical Care in General Critical Care

Philips, S., Shi, Y., Coopersmith, C. M., Samuels, O. B., Pimentel-Farias, C., Mei, Y., Sadan, O., & Akbik, F. (n.d.).

Publication year

2023

Journal title

Neurocritical Care

Volume

Issue

Page(s)

320-325

10.1007/s12028-022-01559-3

Abstract

Abstract

Background: COVID-19 surges led to significant challenges in ensuring critical care capacity. In response, some centers leveraged neurocritical care (NCC) capacity as part of the surge response, with neurointensivists providing general critical care for patients with COVID-19 without neurologic illness. The relative outcomes of NCC critical care management of patients with COVID-19 remain unclear and may help guide further surge planning and provide broader insights into general critical care provided in NCC units. Methods: We performed an observational cohort study of all patients requiring critical care for COVID-19 across four hospitals within the Emory Healthcare system during the first three surges. Patients were categorized on the basis of admission to intensive care units (ICUs) staffed by general intensivists or neurointensivists. Patients with primary neurological diagnoses were excluded. Baseline demographics, clinical complications, and outcomes were compared between groups using univariable and propensity score matching statistics. Results: A total of 1141 patients with a primary diagnosis of COVID-19 required ICU admission. ICUs were staffed by general intensivists (n = 1071) or neurointensivists (n = 70). Baseline demographics and presentation characteristics were similar between groups, except for patients admitted to neurointensivist-staffed ICUs being younger (59 vs. 65, p = 0.027) and having a higher PaO2/FiO2 ratio (153 vs. 120, p = 0.002). After propensity score matching, there was no correlation between ICU staffing and the use of mechanical ventilation, renal replacement therapy, and vasopressors. The rates of in-hospital mortality and hospice disposition were similar in neurointensivist-staffed COVID-19 units (odds ratio 0.9, 95% confidence interval 0.31–2.64, p = 0.842). Conclusions: COVID-19 surges precipitated a natural experiment in which neurology-trained neurointensivists provided critical care in a comparable context to general intensivists treating the same disease. Neurology-trained neurointensivists delivered comparable outcomes to those of general ICUs during COVID-19 surges. These results further support the role of NCC in meeting general critical care needs of neurocritically ill patients and as a viable surge resource in general critical care.

Adaptive Partially Observed Sequential Change Detection and Isolation

Zhao, X., Hu, J., Mei, Y., & Yan1, H. (n.d.).

Publication year

2022

Journal title

Technometrics

Volume

Issue

Page(s)

502-512

10.1080/00401706.2022.2124307

Abstract

Abstract

High-dimensional data has become popular due to the easy accessibility of sensors in modern industrial applications. However, one specific challenge is that it is often not easy to obtain complete measurements due to limited sensing powers and resource constraints. Furthermore, distinct failure patterns may exist in the systems, and it is necessary to identify the true failure pattern. This work focuses on the online adaptive monitoring of high-dimensional data in resource-constrained environments with multiple potential failure modes. To achieve this, we propose to apply the Shiryaev–Roberts procedure on the failure mode level and use the multi-arm bandit to balance the exploration and exploitation. We further discuss the theoretical property of the proposed algorithm to show that the proposed method can correctly isolate the failure mode. Finally, extensive simulations and two case studies demonstrate that the change point detection performance and the failure mode isolation accuracy can be greatly improved.

Asymptotic Theory of `1-Regularized PDE Identification from a Single Noisy Trajectory

He, Y., Suh, N., Huo, X., Kang, S. H., & Mei, Y. (n.d.).

Publication year

2022

Journal title

SIAM-ASA Journal on Uncertainty Quantification

Volume

Issue

Page(s)

1012-1036

10.1137/21M1398884

Abstract

Abstract

We provide a formal theoretical analysis on the PDE identification via the `1-regularized pseudo least square method from the statistical point of view. In this article, we assume that the differential equation governing the dynamic system can be represented as a linear combination of various linear and nonlinear differential terms. Under noisy observations, we employ local-polynomial fitting for estimating state variables and apply the `1 penalty for model selection. Our theory proves that the classical mutual incoherence condition on the feature matrix F and the β∗min-condition for the ground-truth signal β∗ are sufficient for the signed-support recovery of the `1-PsLS method. We run numerical experiments on two popular PDE models, the viscous Burgers and the Korteweg-de Vries (KdV) equations, and the results from the experiments corroborate our theoretical predictions.

Cannabis Use Is Not Associated with Aneurysmal Subarachnoid Hemorrhage Complications or Outcomes

Akbik, F., Konan, H. D., Williams, K. P., Ermias, L. M., Shi, Y., Takieddin, O., Grossberg, J. A., Howard, B. M., Tong, F., Cawley, C. M., Mei, Y., Samuels, O. B., & Sadan, O. (n.d.).

Publication year

2022

Journal title

Stroke

Volume

Issue

Page(s)

E375-E376

10.1161/STROKEAHA.122.038951

Does intrathecal nicardipine for cerebral vasospasm following subarachnoid hemorrhage correlate with reduced delayed cerebral ischemia? A retrospective propensity score-based analysis

Sadan, O., Waddel, H., Moore, R., Feng, C., Mei, Y., Pearce, D., Kraft, J., Pimentel, C., Mathew, S., Akbik, F., Ameli, P., Taylor, A., Danyluk, L., Martin, K. S., Garner, K., Kolenda, J., Pujari, A., Asbury, W., Jaja, B. N., … Samuels, O. (n.d.).

Publication year

2022

Journal title

Journal of Neurosurgery

Volume

136

Issue

Page(s)

115-124

10.3171/2020.12.JNS203673

Abstract

Abstract

OBJECTIVE Cerebral vasospasm and delayed cerebral ischemia (DCI) contribute to poor outcome following subarachnoid hemorrhage (SAH). With the paucity of effective treatments, the authors describe their experience with intrathecal (IT) nicardipine for this indication. METHODS Patients admitted to the Emory University Hospital neuroscience ICU between 2012 and 2017 with nontraumatic SAH, either aneurysmal or idiopathic, were included in the analysis. Using a propensity-score model, this patient cohort was compared to patients in the Subarachnoid Hemorrhage International Trialists (SAHIT) repository who did not receive IT nicardipine. The primary outcome was DCI. Secondary outcomes were long-term functional outcome and adverse events. RESULTS The analysis included 1351 patients, 422 of whom were diagnosed with cerebral vasospasm and treated with IT nicardipine. When compared with patients with no vasospasm (n = 859), the treated group was significantly younger (mean age 51.1 ± 12.4 years vs 56.7 ± 14.1 years, p < 0.001), had a higher World Federation of Neurosurgical Societies score and modified Fisher grade, and were more likely to undergo clipping of the ruptured aneurysm as compared to endovascular treatment (30.3% vs 11.3%, p < 0.001). Treatment with IT nicardipine decreased the daily mean transcranial Doppler velocities in 77.3% of the treated patients. When compared to patients not receiving IT nicardipine, treatment was not associated with an increased rate of bacterial ventriculitis (3.1% vs 2.7%, p > 0.1), yet higher rates of ventriculoperitoneal shunting were noted (19.9% vs 8.8%, p < 0.01). In a propensity score comparison to the SAHIT database, the odds ratio (OR) to develop DCI with IT nicardipine treatment was 0.61 (95% confidence interval [CI] 0.44-0.84), and the OR to have a favorable functional outcome (modified Rankin Scale score ≤ 2) was 2.17 (95% CI 1.61-2.91). CONCLUSIONS IT nicardipine was associated with improved outcome and reduced DCI compared with propensitymatched controls. There was an increased need for permanent CSF diversion but no other safety issues. These data should be considered when selecting medications and treatments to study in future randomized controlled clinical trials for SAH.

Predicting the rheology of limestone calcined clay cements (LC3): Linking composition and hydration kinetics to yield stress through Machine Learning

Canbek, O., Xu, Q., Mei, Y., Washburn, N. R., & Kurtis, K. E. (n.d.).

Publication year

2022

Journal title

Cement and Concrete Research

Volume

160

10.1016/j.cemconres.2022.106925

Abstract

Abstract

The physicochemical characteristics of calcined clay influence yield stress of limestone calcined clay cements (LC3), but the independent influences the clay's physical and chemical characteristics as well as the effect of other variables on LC3 rheology are less well-understood. Further, a relationship between LC3 hydration kinetics and yield stress – important for informing mixture design – has not yet been established. Here, rheological properties were determined in pastes with varying water-to-solid ratio (w/s), constituent mass ratios (PC:metakaolin:limestone), limestone particle size and gypsum content. From these data, an ML model developed allowed the independent examination of the different mechanisms by which metakaolin fraction influences yield stress of LC3, identifying four predictors – packing index, Al2O3/SO3, total particle density and metakaolin fraction relative to limestone (MK/LS) – most significant for predicting LC3 yield stress. A methodology based on kernel smoothing also identified hydration kinetics parameters best correlated with yield stress.

Rapid detection of hot-spots via tensor decomposition with applications to crime rate data

Zhao, Y., Yan, H., Holte, S., & Mei, Y. (n.d.).

Publication year

2022

Journal title

Journal of Applied Statistics

Volume

Issue

Page(s)

1636-1662

10.1080/02664763.2021.1874892

Abstract

Abstract

In many real-world applications of monitoring multivariate spatio-temporal data that are non-stationary over time, one is often interested in detecting hot-spots with spatial sparsity and temporal consistency, instead of detecting system-wise changes as in traditional statistical process control (SPC) literature. In this paper, we propose an efficient method to detect hot-spots through tensor decomposition, and our method has three steps. First, we fit the observed data into a Smooth Sparse Decomposition Tensor (SSD-Tensor) model that serves as a dimension reduction and de-noising technique: it is an additive model decomposing the original data into: smooth but non-stationary global mean, sparse local anomalies, and random noises. Next, we estimate model parameters by the penalized framework that includes Least Absolute Shrinkage and Selection Operator (LASSO) and fused LASSO penalty. An efficient recursive optimization algorithm is developed based on Fast Iterative Shrinkage Thresholding Algorithm (FISTA). Finally, we apply a Cumulative Sum (CUSUM) Control Chart to monitor model residuals after removing global means, which helps to detect when and where hot-spots occur. To demonstrate the usefulness of our proposed SSD-Tensor method, we compare it with several other methods including scan statistics, LASSO-based, PCA-based, T2-based control chart in extensive numerical simulation studies and a real crime rate dataset.

Robust change detection for large-scale data streams

Zhang, R., Mei, Y., & Shi, J. (n.d.).

Publication year

2022

Journal title

Sequential Analysis

Volume

Issue

Page(s)

1-19

10.1080/07474946.2022.2043045

Abstract

Abstract

Robust change point detection for large-scale data streams has many real-world applications in industrial quality control, signal detection, and biosurveillance. Unfortunately, it is highly nontrivial to develop efficient schemes due to three challenges: (1) the unknown sparse subset of affected data streams, (2) the unexpected outliers, and (3) computational scalability for real-time monitoring and detection. In this article, we develop a family of efficient real-time robust detection schemes for monitoring large-scale independent data streams. For each data stream, we propose to construct a new local robust detection statistic called the (Formula presented.) -CUSUM (cumulative sum) statistic that can reduce the effect of outliers by using the Box-Cox transformation of the likelihood function. Then the global scheme will raise an alarm based upon the sum of the shrinkage transformation of these local (Formula presented.) -CUSUM statistics to filter out unaffected data streams. In addition, we propose a new concept called false alarm breakdown point to measure the robustness of online monitoring schemes and propose a worst-case detection efficiency score to measure the detection efficiency when the data contain outliers. We then characterize the breakdown point and the efficiency score of our proposed schemes. Asymptotic analysis and numerical simulations are conducted to illustrate the robustness and efficiency of our proposed schemes.

ROBUSTNESS AND TRACTABILITY FOR NONCONVEX M-ESTIMATORS

Zhang, R., Mei, Y., Shi, J., & Xu, H. (n.d.).

Publication year

2022

Journal title

Statistica Sinica

Volume

Issue

Page(s)

1295-1316

10.5705/ss.202019.0324

Abstract

Abstract

We investigate two important properties of M-estimators, namely, robustness and tractability, in a linear regression setting, when the observations are contaminated by some arbitrary outliers. Specifically, robustness is the statistical property that the estimator should always be close to the true underlying parameters, regardless of the distribution of the outliers, and tractability refers to the computational property that the estimator can be computed efficiently, even if the objective function of the M-estimator is nonconvex. In this article, by examining the empirical risk, we show that under some sufficient conditions, many M-estimators enjoy nice robustness and tractability properties simultaneously when the percentage of outliers is small. We extend our analysis to the high-dimensional setting, where the number of parameters is greater than the number of samples, p ≫ n, and prove that when the proportion of outliers is small, the penalized M-estimators with the L1 penalty enjoy robustness and tractability simultaneously. Our research provides an analytic approach to determine the effects of outliers and tuning parameters on the robustness and tractability of some families of M-estimators. Simulations and case studies are presented to illustrate the usefulness of our theoretical results for M-estimators under Welsch’s exponential squared loss and Tukey’s bisquare loss.

Treatment Effect Modeling for FTIR Signals Subject to Multiple Sources of Uncertainties

Tian, H., Wang, A., Chen, J., Jiang, X., Shi, J., Zhang, C., Mei, Y., & Wang, B. (n.d.).

Publication year

2022

Journal title

IEEE Transactions on Automation Science and Engineering

Volume

Issue

Page(s)

895-906

10.1109/TASE.2021.3058564

Abstract

Abstract

Fourier-transform infrared spectroscopy (FTIR) is a widely adopted technique for characterizing the chemical composition in many physical and chemical analyses. However, FTIR spectra are subject to multiple sources of uncertainty, and thus the analysis of them relies on domain experts and can only lead to qualitative conclusions. This study aims to analyze the effect of a certain treatment on FTIR spectra subject to two commonly observed uncertainties, the offset shift and the multiplicative error. Due to these uncertainties, the pre-exposure FTIR spectra are modeled according to the physical understanding of the uncertainty - observed spectra can be viewed as translating and stretchering an underlying template signal, and the post-exposure FTIR spectra are modeled as the translated and stretchered template signal plus an extra functional treatment effect. To provide engineering interpretation, the treatment effect is modeled as the product of the pattern of modification and its corresponding magnitude. A two-step parameter estimation algorithm is developed to estimate the underlying template signal, the pattern of modification, and the magnitude of modification at various treatment strengths. The effectiveness of the proposed method is validated in a simulation study. Furtherly, in a real case study, the proposed method is used to investigate the effect of plasma exposure on the FTIR spectra. As a result, the proposed method effectively identifies the pattern of modification under uncertainties in the manufacturing environment, which matches the knowledge of the affected chemical components by the plasma treatment. And the recovered magnitude of modification provides guidance in selecting the control parameter of the plasma treatment. Note to Practitioners - FTIR spectrometer is often used to characterize the surface chemical composition of a material. Due to the large uncertainties associated with the nature of spectrometer and the measurement environment, the FTIR signals are usually examined visually by experienced engineers and technicians in industrial applications, which can be both time-consuming and inaccurate. To understand the effect of plasma exposure on the surface property of carbon fiber reinforced polymer (CFRP) material, the elimination of uncertainties associated with FTIR signals is investigated, and a systematic method is proposed to quantify the effect of surface treatments on FTIR signals. A two-step analytic procedure is proposed, which provides information on how the plasma exposure distorts the FTIR signals, and how the plasma distance relates to the magnitude of the distortion. The methodology in this article can be used to analyze the treatment effect on a variety of spectroscopic measurements that are subject to uncertainties such as offset and scaling errors, which expands the applications of in situ handheld spectrometer metrology in manufacturing industries.

Yajun Mei

Yajun Mei

Professor of Biostatistics

Professional overview

Education

Honors and awards

Publications

Publications

Impact of compensation coefficients on active sequential change point detection

Publication year

Journal title

Volume

Issue

Page(s)

Jugular Venous Catheterization is Not Associated with Increased Complications in Patients with Aneurysmal Subarachnoid Hemorrhage

Publication year

Journal title

Volume

Issue

Page(s)

Cost-efficient fixed-width confidence intervals for the difference of two Bernoulli proportions

Publication year

Journal title

Volume

Issue

Page(s)

Directional false discovery rate control in large-scale multiple comparisons

Publication year

Journal title

Volume

Issue

Page(s)

Pharmacologic Venous Thromboembolism Prophylaxis in Patients with Nontraumatic Subarachnoid Hemorrhage Requiring an External Ventricular Drain

Publication year

Journal title

Volume

Issue

Page(s)

Rollout designs for lump-sum data

Publication year

Journal title

Active learning-based multistage sequential decision-making model with application on common bile duct stone evaluation

Publication year

Journal title

Volume

Issue

Page(s)

Adaptive resources allocation CUSUM for binomial count data monitoring with application to COVID-19 hotspot detection

Publication year

Journal title

Volume

Issue

Page(s)

Asymptotic optimality theory for active quickest detection with unknown postchange parameters

Publication year

Journal title

Volume

Issue

Page(s)

Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control

Publication year

Journal title

Volume

Issue

Page(s)

CSSQ: a ChIP-seq signal quantifier pipeline

Publication year

Journal title

Volume

Editorial to the special issue: modern streaming data analytics

Publication year

Journal title

Volume

Issue

Page(s)

Hot-spots detection in count data by Poisson assisted smooth sparse tensor decomposition

Publication year

Journal title

Volume

Issue