Statistica Sinica最新文献

英文中文

Homogeneity Tests for High-dimensional Mean Vectors and Covariance Matrices 高维均值向量和协方差矩阵的齐性检验

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202022.0048

Wenwen Guo, Xinyuan Song, H. Cui

Homogeneity Tests

本研究旨在开发高维均值向量和协方差矩阵的同质性检验，其中特征数可能大于样本量。我们引入两种分类加权统计来检验均值和协方差矩阵的相等性。我们建立了所提出的检验统计量在一定温和条件下的渐近分布，并开发了简化算法以方便实现和应用。仿真研究表明，在经验尺度和功率方面，所提出的测试具有令人满意的性能。我们还将提出的测试程序应用于两个微阵列数据集。

引用次数: 0

SEMIPARAMETRIC REVERSED MEAN MODEL FOR RECURRENT EVENT PROCESS WITH INFORMATIVE TERMINAL EVENT. 具有信息终点事件的循环事件过程的半参数反均值模型。

IF 1.2 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0353

Wen Su, Li Liu, Guosheng Yin, Xingqiu Zhao, Ying Zhang

We study semiparametric regression for a recurrent event process with an informative terminal event, where observations are taken only at discrete time points, rather than continuously over time. To account for the effect of a terminal event on the recurrent event process, we propose a semiparametric reversed mean model, for which we develop a two-stage sieve likelihood-based method to estimate the baseline mean function and the covariate effects. Our approach overcomes the computational difficulties arising from the nuisance functional parameter in the assumption that the likelihood is based on a Poisson process. We establish the consistency, convergence rate, and asymptotic normality of the proposed two-stage estimator, which is robust against the assumption of an underlying Poisson process. The proposed method is evaluated using extensive simulation studies, and demonstrated using panel count data from a longitudinal healthy longevity study and data from a bladder tumor study.

我们研究了具有信息终端事件的循环事件过程的半参数回归，其中观测仅在离散时间点进行，而不是连续时间。为了解释终端事件对循环事件过程的影响，我们提出了一种半参数反均值模型，为此我们开发了一种基于两阶段筛似然的方法来估计基线均值函数和协变量效应。我们的方法克服了在假设似然是基于泊松过程的情况下由讨厌的函数参数引起的计算困难。我们建立了所提出的两阶段估计量的一致性，收敛率和渐近正态性，它对潜在泊松过程的假设是鲁棒的。该方法通过广泛的模拟研究进行了评估，并通过纵向健康长寿研究和膀胱肿瘤研究的面板计数数据进行了验证。

引用次数: 0

Outlier Detection via a Minimum Ridge Covariance Determinant Estimator 基于最小脊协方差行列式估计的离群点检测

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202022.0142

Chikun Li, B. Jin, Yuehua Wu

: In this paper, we propose an outlier detection procedure, based on a high-breakdown minimum ridge covariance determinant estimator that is especially useful for the large p/n scenario. The estimator is obtained from the subset of observations, after excluding potential outliers, by applying the so-called concentration steps. We explore the asymptotic distribution of the modified Mahalanobis distance related to the proposed estimator under certain moment conditions, and obtain a theoretical cutoff value for outlier identification. We also improve the outlier detection power by adding a one-step reweighting procedure. Lastly, we investigate the performance of the proposed methods using simulations and a real-data analysis.

在本文中，我们提出了一种基于高击穿最小脊协方差行列式估计的离群值检测方法，该方法对大p/n场景特别有用。通过应用所谓的集中步骤，在排除潜在的异常值后，从观测值的子集中获得估计量。在一定的矩条件下，我们研究了与所提估计量相关的修正马氏距离的渐近分布，并得到了一个用于离群值识别的理论截断值。我们还通过增加一步重加权过程来提高离群值检测能力。最后，我们通过仿真和实际数据分析来验证所提出方法的性能。

引用次数: 0

On the Efficiency of Composite Likelihood Estimation for Gaussian Spatial Processes 高斯空间过程的复合似然估计效率研究

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202020.0311

N. Chua, Francis K. C. Hui, A. Welsh

the Efficiency of Composite Likelihood

最大复合似然估计是标准最大似然估计的一种有吸引力且常用的替代方法，标准最大似然估计通常涉及牺牲统计效率以提高计算效率。这种统计效率可以通过评估最大复合似然估计量的三明治信息矩阵来量化，然后将其与最大似然估计量的类似Fisher信息矩阵进行比较。本文导出了一维指数协方差高斯过程的各种极大复合似然估计的渐近相对效率的新的封闭表达式。这些表达式基于一种抽样方案，该方案允许在三种常见的空间渐近框架下进行分析:扩展域、填充和混合。我们的结果证明了复合似然的选择如何影响估计的效率和一致性，特别是对于填充和混合框架。

引用次数: 0

Adaptive Randomization via Mahalanobis Distance 基于马氏距离的自适应随机化

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202020.0440

Yichen Qin, Y. Li, Wei Ma, Haoyu Yang, F. Hu

: In comparative studies, researchers often seek an optimal covariate balance. However, chance imbalance still exists in randomized experiments, and becomes more serious as the number of covariates increases. To address this issue, we introduce a new randomization procedure, called adaptive randomization via the Mahalanobis distance (ARM). The proposed method allocates units sequentially and adaptively, using information on the current level of imbalance and the incoming unit’s covariate. Theoretical results and numerical comparison show that with a large number of covariates or a large number of units, the proposed method shows substantial advantages over traditional methods in terms of the covariate balance, estimation accuracy, hypothesis testing power, and computational time. The proposed method attains the optimal covariate balance, in the sense that the estimated treatment effect attains its minimum variance asymptotically, and can be applied in both causal inference and clinical trials. Lastly, numerical stud-1

在比较研究中，研究人员经常寻求最佳协变量平衡。然而，随机实验中仍然存在机会不平衡现象，并且随着协变量数量的增加，机会不平衡现象更加严重。为了解决这个问题，我们引入了一种新的随机化程序，称为通过马氏距离(ARM)的自适应随机化。该方法利用当前不平衡水平和输入单元的协变量信息，自适应地顺序分配单元。理论结果和数值比较表明，在协变量较多或单位较多的情况下，本文提出的方法在协变量平衡、估计精度、假设检验能力、计算时间等方面都比传统方法有较大的优势。该方法实现了最优协变量平衡，即估计的治疗效果渐近地达到其最小方差，可以应用于因果推理和临床试验。最后，数值研究[中国统计:预印本doi:10.5705/ss.202020.0440]

引用次数: 2

Regression Analysis of Spatially Correlated Event Durations With Missing Origins Annotated by Longitudinal Measures 纵向测量中缺失起源的空间相关事件持续时间回归分析

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0118

Y. Xiong, W. J. Braun, T. Duchesne, X. J. Hu

This paper is concerned with event durations in situations where the study units may be spatially correlated and the time origins of the events are missing. We develop regression models based on the partly observed durations with the aid of available longitudinal information. The first-hitting-time model (e.g. Lee and Whitmore, 2006) is employed to link the data of event durations and the associated longitudinal measures with shared random effects. We present procedures for estimating the model parameters and an induced estimator of the conditional distribution of the event duration. We apply the EM algorithm and Monte Carlo methods to compute the proposed estimators. We establish consistency and asymptotic normality of the estimators, and present their variance estimation. The proposed approach is illustrated with a collection of wildfire records from Alberta, Canada. Its performance is examined numerically and compared with two competitors via simulation.

本文关注的是在研究单元可能是空间相关的，而事件的时间起源缺失的情况下的事件持续时间。我们开发回归模型基于部分观测的持续时间与可用的纵向信息的帮助。采用首次撞击时间模型(如Lee和Whitmore, 2006)将事件持续时间和相关纵向测量数据与共享随机效应联系起来。我们提出了估计模型参数的程序和事件持续时间条件分布的诱导估计器。我们应用EM算法和蒙特卡罗方法来计算所提出的估计量。我们建立了估计量的相合性和渐近正态性，并给出了它们的方差估计。所提出的方法以加拿大阿尔伯塔省的野火记录集为例。对其性能进行了数值检验，并与两种竞争产品进行了仿真比较。

引用次数: 0

Simultaneous Functional Quantile Regression 同时功能分位数回归

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0248

Boyi Hu, Xixi Hu, Hua Liu, Jinhong You, Jiguo Cao

The conventional method for functional quantile regression (FQR) is to fit the regression model for each quantile of interest separately. Therefore, the slope function of the regression, as a bivariate function of time and quantile, is estimated as a univariate function of time for each fixed quantile. However, there are several limitations to this conventional strategy. For example, it cannot guarantee the monotonicity of the conditional quantiles, nor can it control the smoothness of the slope estimator as a bivariate function. In this paper, we propose a new framework for FQR, in which we simultaneously fit the FQR model for multiple quantiles, with the help of a bivariate basis under some constraints, such that the estimated quantiles satisfy the monotonicity conditions and the smoothness of the slope estimator is controlled. The proposed estimator for the slope function is shown to be asymptotically consistent, and we establish its asymptotic normality. We use simulation to evaluate the finite-sample performance of the proposed method and compare it with that of the conventional method. We demonstrate the proposed method by analyzing the effects of Statistica Sinica: Preprint doi:10.5705/ss.202021.0248

功能分位数回归(FQR)的传统方法是对每个感兴趣的分位数分别拟合回归模型。因此，回归的斜率函数作为时间和分位数的二元函数，被估计为每个固定分位数的单变量时间函数。然而，这种传统策略有几个限制。例如，它不能保证条件分位数的单调性，也不能控制斜率估计器作为二元函数的平滑性。本文提出了一种新的FQR框架，在一定的约束条件下，利用二元基同时拟合多个分位数的FQR模型，使估计的分位数满足单调性条件，并控制斜率估计量的平滑性。证明了所提出的斜率函数的估计量是渐近一致的，并建立了其渐近正态性。通过仿真对该方法的有限样本性能进行了评价，并与传统方法进行了比较。我们通过分析中国统计:预印本doi:10.5705/ss.202021.0248的效果来证明所提出的方法

{"title":"Simultaneous Functional Quantile Regression","authors":"Boyi Hu, Xixi Hu, Hua Liu, Jinhong You, Jiguo Cao","doi":"10.5705/ss.202021.0248","DOIUrl":"https://doi.org/10.5705/ss.202021.0248","url":null,"abstract":"The conventional method for functional quantile regression (FQR) is to fit the regression model for each quantile of interest separately. Therefore, the slope function of the regression, as a bivariate function of time and quantile, is estimated as a univariate function of time for each fixed quantile. However, there are several limitations to this conventional strategy. For example, it cannot guarantee the monotonicity of the conditional quantiles, nor can it control the smoothness of the slope estimator as a bivariate function. In this paper, we propose a new framework for FQR, in which we simultaneously fit the FQR model for multiple quantiles, with the help of a bivariate basis under some constraints, such that the estimated quantiles satisfy the monotonicity conditions and the smoothness of the slope estimator is controlled. The proposed estimator for the slope function is shown to be asymptotically consistent, and we establish its asymptotic normality. We use simulation to evaluate the finite-sample performance of the proposed method and compare it with that of the conventional method. We demonstrate the proposed method by analyzing the effects of Statistica Sinica: Preprint doi:10.5705/ss.202021.0248","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalized Odds Rate Frailty Models for Current Status Data with Informative Censoring 具有信息过滤的当前状态数据的广义优势率脆弱模型

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0411

Yang Xu, Shishun Zhao, T. Hu, Jianguo Sun

: Current-status data occur in many areas, and the analysis of such data attracted much attention. In this study, we consider a regression analysis of current-status data in the presence of informative censoring, for which most existing methods either apply only to limited situations or are computationally unstable. Here, we propose a new sieve maximum likelihood estimation procedure under the class of semiparametric generalized odds rate frailty models. The proposed method uses the latent variable to describe the informative censoring or relationship between the failure time of interest and the censoring time. We develop a novel expectation-maximization algorithm for determining the proposed estimators, and establish their asymptotic consistency and normality. The results of a simulation study show that the proposed method performs well in practical

现状数据出现在许多领域，对这些数据的分析引起了人们的广泛关注。在本研究中，我们考虑在存在信息审查的情况下对当前状态数据进行回归分析，因为大多数现有方法要么只适用于有限的情况，要么在计算上不稳定。在半参数广义优势率脆弱性模型下，我们提出了一种新的筛极大似然估计方法。该方法使用隐变量来描述信息的审查或感兴趣的失效时间与审查时间之间的关系。我们开发了一种新的期望最大化算法来确定所提出的估计量，并建立了它们的渐近相合性和正态性。仿真研究结果表明，该方法在实际应用中具有良好的性能。E-mail: hutaomath@foxmail.com中国统计:预印本doi:10.5705/ss.202021.0411

引用次数: 0

Statistical Inference for Mean Function of Longitudinal Imaging Data over Complicated Domains 复杂域纵向成像数据均值函数的统计推断

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0415

Qirui Hu, Jie Li

We propose a novel procedure for estimating the mean function of longitudinal imaging data with inherent spatial and temporal correlation. We depict the dependence between temporally ordered images using a functional moving average, and use flexible bivariate splines over triangulations to handle the irregular domain of images which is common in imaging studies. We establish both the global and the local asymptotic properties of the bivariate spline estimator for the mean function, with simultaneous confidence corridors (SCCs) as a theoretical byproduct. Under some mild conditions, the proposed estimator and its accompanying SCCs are shown to be consistent and oracle efficient, as though all images were entirely observed without errors. We use Monte Carlo simulation experiments to demonstrate the finite-sample performance of the proposed method, the results of which strongly corroborate the asymptotic theory. The proposed method is further illustrated by analyzing two seawater potential temperature data sets.

引用次数: 1

Group Testing Regression Analysis with Missing Data and Imperfect Tests 缺失数据和不完善检验的组检验回归分析

IF 1.4 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica

Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0382

A. Delaigle, Ruoxu Tan

: Estimating the prevalence of an infectious disease in a big population typically requires testing a specimen (e.g., blood, urine, or swab) for the disease. When the disease spreads quickly, time constraints and limited resources often restrict the number of tests that can be performed. In such cases, if the prevalence is not too high, the group testing procedure can be employed to save time, money, and resources. The procedure tests pooled specimens of groups of individuals, rather than testing each individual for the disease. This technique is also used in other contexts, for example, to detect abnormalities or contamination in animals, plants, food, or water. Although methods exist for estimating a prevalence conditional on the explanatory variables from the group testing data, they require the specimen to be available for all individuals, which is not always possible. Therefore, we construct new nonparametric estimators that are consistent when some of the specimens are missing. We demonstrate the numerical performance of our methods using simulations and a hepatitis B example.

估计传染病在大人群中的流行情况通常需要检测该疾病的标本(如血液、尿液或拭子)。当疾病迅速传播时，时间限制和有限的资源往往会限制可进行的检测数量。在这种情况下，如果患病率不是太高，可以采用分组测试程序来节省时间、金钱和资源。该程序测试汇集了个体群体的标本，而不是对每个个体进行疾病检测。这项技术也可用于其他场合，例如，检测动物、植物、食物或水的异常或污染。虽然现有方法可以根据群体测试数据的解释变量来估计患病率，但它们要求所有个体都可以获得样本，这并不总是可能的。因此，我们构造了新的非参数估计量，当某些样本缺失时，它是一致的。我们用模拟和一个乙型肝炎的例子来证明我们的方法的数值性能。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Statistica Sinica

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀