Dual Formulation

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 4251 Experts worldwide ranked by ideXlab platform

N. D. Hari Dass - One of the best experts on this subject based on the ideXlab platform.

Enlu Zhou - One of the best experts on this subject based on the ideXlab platform.

  • Information Relaxation and Dual Formulation of
    2020
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov deci- sion processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose a penalty to punish the access to the information in ad- vance. We establish the weak Duality, strong Duality and comple- mentary slackness results in a parallel way as those in Markov decision processes. We further explore the structure of the optimal penalties and expose the connection between the optimal penalties for Markov decision processes and controlled Markov diffusions. We demonstrate the use of this Dual representation in a classic dynamic portfolio choice problem through a new class of penalties, which require little extra computation and produce small Duality gap on the optimal value.

  • Dual Formulation of Controlled Markov Diffusions and Its Application
    IFAC Proceedings Volumes, 2020
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Abstract Information relaxation and Duality in Markov decision processes have been studied recently to derive upper bounds on the maximal expected reward (or lower bounds on the minimal expected cost). The idea is to relax the non-anticipativity constraint on the controls and impose a penalty to punish such a violation. In this paper we generalize this Dual approach to controlled Markov diffusions. We develop the weak Duality and strong Duality results, and explore the structure of the optimal penalty. We demonstrate the use of this Dual Formulation by computing upper bounds on the optimal expected utility in a dynamic portfolio choice problem.

  • Information Relaxation and Dual Formulation of Controlled Markov Diffusions
    IEEE Transactions on Automatic Control, 2015
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov decision processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose a penalty to punish the access to the information in advance. We establish the weak Duality, strong Duality and complementary slackness results in a parallel way as those in Markov decision processes. We further explore the structure of the optimal penalties and expose the connection between the optimal penalties for Markov decision processes and controlled Markov diffusions. We demonstrate the use of this Dual representation in a classic dynamic portfolio choice problem through a new class of penalties, which require little extra computation and produce small Duality gap on the optimal value.

  • information relaxation and Dual Formulation of controlled markov diffusions
    arXiv: Optimization and Control, 2013
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov decision processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose penalty to punish the access to the information in advance. We establish the weak Duality, strong Duality and complementary slackness results in a parallel way as those in Markov decision processes. We explore the structure of the optimal penalties and expose the connection between Markov decision processes and controlled Markov diffusions. We demonstrate the use of the Dual representation for controlled Markov diffusions in a classic dynamic portfolio choice problem. We evaluate the lower bounds on the expected utility by Monte Carlo simulation under a sub-optimal policy, and we propose a new class of penalties to derive upper bounds with little extra computation. The small gaps between the lower bounds and upper bounds indicate that the available policy is near optimal as well as the effectiveness of our proposed penalty in the Dual method.

Fan Ye - One of the best experts on this subject based on the ideXlab platform.

  • Information Relaxation and Dual Formulation of
    2020
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov deci- sion processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose a penalty to punish the access to the information in ad- vance. We establish the weak Duality, strong Duality and comple- mentary slackness results in a parallel way as those in Markov decision processes. We further explore the structure of the optimal penalties and expose the connection between the optimal penalties for Markov decision processes and controlled Markov diffusions. We demonstrate the use of this Dual representation in a classic dynamic portfolio choice problem through a new class of penalties, which require little extra computation and produce small Duality gap on the optimal value.

  • Dual Formulation of Controlled Markov Diffusions and Its Application
    IFAC Proceedings Volumes, 2020
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Abstract Information relaxation and Duality in Markov decision processes have been studied recently to derive upper bounds on the maximal expected reward (or lower bounds on the minimal expected cost). The idea is to relax the non-anticipativity constraint on the controls and impose a penalty to punish such a violation. In this paper we generalize this Dual approach to controlled Markov diffusions. We develop the weak Duality and strong Duality results, and explore the structure of the optimal penalty. We demonstrate the use of this Dual Formulation by computing upper bounds on the optimal expected utility in a dynamic portfolio choice problem.

  • Information Relaxation and Dual Formulation of Controlled Markov Diffusions
    IEEE Transactions on Automatic Control, 2015
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov decision processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose a penalty to punish the access to the information in advance. We establish the weak Duality, strong Duality and complementary slackness results in a parallel way as those in Markov decision processes. We further explore the structure of the optimal penalties and expose the connection between the optimal penalties for Markov decision processes and controlled Markov diffusions. We demonstrate the use of this Dual representation in a classic dynamic portfolio choice problem through a new class of penalties, which require little extra computation and produce small Duality gap on the optimal value.

  • information relaxation and Dual Formulation of controlled markov diffusions
    arXiv: Optimization and Control, 2013
    Co-Authors: Fan Ye, Enlu Zhou
    Abstract:

    Information relaxation and Duality in Markov decision processes have been studied recently by several researchers with the goal to derive Dual bounds on the value function. In this paper we extend this Dual Formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose penalty to punish the access to the information in advance. We establish the weak Duality, strong Duality and complementary slackness results in a parallel way as those in Markov decision processes. We explore the structure of the optimal penalties and expose the connection between Markov decision processes and controlled Markov diffusions. We demonstrate the use of the Dual representation for controlled Markov diffusions in a classic dynamic portfolio choice problem. We evaluate the lower bounds on the expected utility by Monte Carlo simulation under a sub-optimal policy, and we propose a new class of penalties to derive upper bounds with little extra computation. The small gaps between the lower bounds and upper bounds indicate that the available policy is near optimal as well as the effectiveness of our proposed penalty in the Dual method.

Frank Neumann - One of the best experts on this subject based on the ideXlab platform.

Myung-gon Yoon - One of the best experts on this subject based on the ideXlab platform.

  • Sign-weighted peak minimization problem for continuous-time systems
    Proceedings of the 2000 American Control Conference. ACC (IEEE Cat. No.00CH36334), 2000
    Co-Authors: Myung-gon Yoon
    Abstract:

    We study the problem of minimizing a parametrized convex combination of the overshoot and undershoot of SISO continuous time system in response to a known input. From a Dual Formulation we develop a condition for solution existence and specify the structure of optimal solution. In addition, an interrelation between the overshoot and undershoot in controller synthesis is analytically explained in our framework.

  • Signed-maximum minimization problem for SISO continuous-time systems
    Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171), 1998
    Co-Authors: Myung-gon Yoon, Ji-yoon Kang
    Abstract:

    In this paper, we study the problem of minimizing signed (negative or positive) maximum amplitude of a regulated output, due to a fixed input. Like L/sub /spl infin// (l/sub /spl infin//) problems, the optimal performance can be computed from Dual Formulation. However, from the lack of the alignment condition, the optimal solution can not be simply specified, if it exists. We also find suboptimal solutions to this problem and specify its form.