Expectation Maximization Algorithm

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 30933 Experts worldwide ranked by ideXlab platform

Omer Ozturk - One of the best experts on this subject based on the ideXlab platform.

  • mixture model analysis of partially rank ordered set samples age groups of fish from length frequency data
    Scandinavian Journal of Statistics, 2015
    Co-Authors: Armin Hatefi, Mohammad Jafari Jozani, Omer Ozturk
    Abstract:

    type="main" xml:id="sjos12140-abs-0001"> We present a novel methodology for estimating the parameters of a finite mixture model (FMM) based on partially rank-ordered set (PROS) sampling and use it in a fishery application. A PROS sampling design first selects a simple random sample of fish and creates partially rank-ordered judgement subsets by dividing units into subsets of prespecified sizes. The final measurements are then obtained from these partially ordered judgement subsets. The traditional ExpectationMaximization Algorithm is not directly applicable for these observations. We propose a suitable ExpectationMaximization Algorithm to estimate the parameters of the FMMs based on PROS samples. We also study the problem of classification of the PROS sample into the components of the FMM. We show that the maximum likelihood estimators based on PROS samples perform substantially better than their simple random sample counterparts even with small samples. The results are used to classify a fish population using the length-frequency data.

  • mixture model analysis of partially rank ordered set samples age groups of fish from length frequency data
    Scandinavian Journal of Statistics, 2015
    Co-Authors: Armin Hatefi, Mohammad Jafari Jozani, Omer Ozturk
    Abstract:

    We present a novel methodology for estimating the parameters of a finite mixture model (FMM) based on partially rank-ordered set (PROS) sampling and use it in a fishery application. A PROS sampling design first selects a simple random sample of fish and creates partially rank-ordered judgement subsets by dividing units into subsets of prespecified sizes. The final measurements are then obtained from these partially ordered judgement subsets. The traditional ExpectationMaximization Algorithm is not directly applicable for these observations. We propose a suitable ExpectationMaximization Algorithm to estimate the parameters of the FMMs based on PROS samples. We also study the problem of classification of the PROS sample into the components of the FMM. We show that the maximum likelihood estimators based on PROS samples perform substantially better than their simple random sample counterparts even with small samples. The results are used to classify a fish population using the length-frequency data.

Xiangyu Guo - One of the best experts on this subject based on the ideXlab platform.

  • robust high dimensional Expectation Maximization Algorithm via trimmed hard thresholding
    Machine Learning, 2020
    Co-Authors: Di Wang, Xiangyu Guo
    Abstract:

    In this paper, we study the problem of estimating latent variable models with arbitrarily corrupted samples in high dimensional space (i.e., $$d\gg n$$ ) where the underlying parameter is assumed to be sparse. Specifically, we propose a method called Trimmed (Gradient) Expectation Maximization which adds a trimming gradients step and a hard thresholding step to the Expectation step (E-step) and the Maximization step (M-step), respectively. We show that under some mild assumptions and with an appropriate initialization, the Algorithm is corruption-proofing and converges to the (near) optimal statistical rate geometrically when the fraction of the corrupted samples $$\epsilon$$ is bounded by $${\tilde{O}}\bigg (\frac{1}{\sqrt{n}}\bigg )$$ . Moreover, we apply our general framework to three canonical models: mixture of Gaussians, mixture of regressions and linear regression with missing covariates. Our theory is supported by thorough numerical results.

  • robust high dimensional Expectation Maximization Algorithm via trimmed hard thresholding
    arXiv: Machine Learning, 2020
    Co-Authors: Di Wang, Xiangyu Guo
    Abstract:

    In this paper, we study the problem of estimating latent variable models with arbitrarily corrupted samples in high dimensional space ({\em i.e.,} $d\gg n$) where the underlying parameter is assumed to be sparse. Specifically, we propose a method called Trimmed (Gradient) Expectation Maximization which adds a trimming gradients step and a hard thresholding step to the Expectation step (E-step) and the Maximization step (M-step), respectively. We show that under some mild assumptions and with an appropriate initialization, the Algorithm is corruption-proofing and converges to the (near) optimal statistical rate geometrically when the fraction of the corrupted samples $\epsilon$ is bounded by $ \tilde{O}(\frac{1}{\sqrt{n}})$. Moreover, we apply our general framework to three canonical models: mixture of Gaussians, mixture of regressions and linear regression with missing covariates. Our theory is supported by thorough numerical results.

Pedro Vilanova - One of the best experts on this subject based on the ideXlab platform.

  • An Efficient Forward-Reverse Expectation-Maximization Algorithm for Statistical Inference in Stochastic Reaction Networks
    Stochastic Analysis and Applications, 2016
    Co-Authors: Christian Bayer, Alvaro Moraes, Raul Tempone, Pedro Vilanova
    Abstract:

    ABSTRACTIn this work, we present an extension of the forward–reverse representation introduced by Bayer and Schoenmakers (Annals of Applied Probability, 24(5):1994–2032, 2014) to the context of stochastic reaction networks (SRNs). We apply this stochastic representation to the computation of efficient approximations of expected values of functionals of SRN bridges, that is, SRNs conditional on their values in the extremes of given time intervals. We then employ this SRN bridge-generation technique to the statistical inference problem of approximating reaction propensities based on discretely observed data. To this end, we introduce a two-phase iterative inference method in which, during phase I, we solve a set of deterministic optimization problems where the SRNs are replaced by their reaction-rate ordinary differential equations approximation; then, during phase II, we apply the Monte Carlo version of the Expectation-Maximization Algorithm to the phase I output. By selecting a set of overdispersed seeds ...

  • an efficient forward reverse Expectation Maximization Algorithm for statistical inference in stochastic reaction networks
    arXiv: Numerical Analysis, 2015
    Co-Authors: Christian Bayer, Alvaro Moraes, Raul Tempone, Pedro Vilanova
    Abstract:

    In this work, we present an extension to the context of Stochastic Reaction Networks (SRNs) of the forward-reverse representation introduced in "Simulation of forward-reverse stochastic representations for conditional diffusions", a 2014 paper by Bayer and Schoenmakers. We apply this stochastic representation in the computation of efficient approximations of expected values of functionals of SNR bridges, i.e., SRNs conditioned to its values in the extremes of given time-intervals. We then employ this SNR bridge-generation technique to the statistical inference problem of approximating the reaction propensities based on discretely observed data. To this end, we introduce a two-phase iterative inference method in which, during phase I, we solve a set of deterministic optimization problems where the SRNs are replaced by their reaction-rate Ordinary Differential Equations (ODEs) approximation; then, during phase II, we apply the Monte Carlo version of the Expectation-Maximization (EM) Algorithm starting from the phase I output. By selecting a set of over dispersed seeds as initial points for phase I, the output of parallel runs from our two-phase method is a cluster of approximate maximum likelihood estimates. Our results are illustrated by numerical examples.

Roberto Horowitz - One of the best experts on this subject based on the ideXlab platform.

  • Prediction on Travel-Time Distribution for Freeways Using Online Expectation Maximization Algorithm
    TRB, 2014
    Co-Authors: Nianfeng Wan, Gabriel Gomes, Roberto Horowitz
    Abstract:

    This paper presents a stochastic model-based approach to freeway travel-time prediction. The approach uses the Link-Node Cell Transmission Model (LN-CTM) to model traffic and provides a probability distribution for travel time. On-ramp and mainline flow profiles are collected from loop detectors, along with their uncertainties. The probability distribution is generated using Monte Carlo simulation and the Online Expectation Maximization clustering Algorithm. The simulation is implemented with a reasonable stopping criterion in order to reduce sample size requirement. Results show that the approach is able to generate an accurate multimodal distribution for travel-time. Future improvements are also discussed.

Armin Hatefi - One of the best experts on this subject based on the ideXlab platform.

  • mixture model analysis of partially rank ordered set samples age groups of fish from length frequency data
    Scandinavian Journal of Statistics, 2015
    Co-Authors: Armin Hatefi, Mohammad Jafari Jozani, Omer Ozturk
    Abstract:

    type="main" xml:id="sjos12140-abs-0001"> We present a novel methodology for estimating the parameters of a finite mixture model (FMM) based on partially rank-ordered set (PROS) sampling and use it in a fishery application. A PROS sampling design first selects a simple random sample of fish and creates partially rank-ordered judgement subsets by dividing units into subsets of prespecified sizes. The final measurements are then obtained from these partially ordered judgement subsets. The traditional ExpectationMaximization Algorithm is not directly applicable for these observations. We propose a suitable ExpectationMaximization Algorithm to estimate the parameters of the FMMs based on PROS samples. We also study the problem of classification of the PROS sample into the components of the FMM. We show that the maximum likelihood estimators based on PROS samples perform substantially better than their simple random sample counterparts even with small samples. The results are used to classify a fish population using the length-frequency data.

  • mixture model analysis of partially rank ordered set samples age groups of fish from length frequency data
    Scandinavian Journal of Statistics, 2015
    Co-Authors: Armin Hatefi, Mohammad Jafari Jozani, Omer Ozturk
    Abstract:

    We present a novel methodology for estimating the parameters of a finite mixture model (FMM) based on partially rank-ordered set (PROS) sampling and use it in a fishery application. A PROS sampling design first selects a simple random sample of fish and creates partially rank-ordered judgement subsets by dividing units into subsets of prespecified sizes. The final measurements are then obtained from these partially ordered judgement subsets. The traditional ExpectationMaximization Algorithm is not directly applicable for these observations. We propose a suitable ExpectationMaximization Algorithm to estimate the parameters of the FMMs based on PROS samples. We also study the problem of classification of the PROS sample into the components of the FMM. We show that the maximum likelihood estimators based on PROS samples perform substantially better than their simple random sample counterparts even with small samples. The results are used to classify a fish population using the length-frequency data.