 A ( k-1) moment optimal policy π is also (k) moment optimal if and only if (-1) k+1Vk (π) satisfies optimal equation where Vk (π) is k-th moment of the total discounted rewards when π is used. (k-1)矩最优策略π为(k)矩最优的充要条件是(-1)~(k+1)V_k(π)满足最优方程,这里V_k(π)为使用π时的总折扣报酬的k阶矩。 短句来源 In this paper,we use the optimal equation with expected state transitions to approximate the optimal equation of MDP average model,and find a general method by which we can determine the error bounds of the solutions of the two optimal equations. 本文通过一个具有期望转移状态的最优方程来逼近MDP平均模型的最优方程,并进一步给出了确立两个最优方程数值解的误差界的一般方法。 短句来源 In this paper,using the generalization of the fixed point theorem for cont-ractions,we set up the optimal equation for non-stationary MDP with the aver-age criterion and supply the sufficent conditions under which either the optimalor ε-optimal polices exists. 本文利用扩充的不动点定理,建立了相应于非平稳MDP平均模型的最优方程,据此给出了最优策略和ε-最优策略存在的充分条件. 短句来源 Among 2 555 selection index equations with different numbers of traits(1≤number of traits ≤9) which were constituted with 22 traits in groups, the optimal equation was constituted with three agronomical traits,i. e. the length of growth durtion,the plant height and grain number per panicle on main stem. 在由22个性状分批构造的2555个不同性状的选择指数方程中（1≤性状数≤9），其最优方程为生育期、株高和主穗粒数构成的方程 短句来源 In this paper, We consider the denumable state space non- stationary MDP average model with incomplete information, By the translation of the model, We build up a optimal equation (OE) for the MDP average Model with incomplete Information, and also give the condition under which the solution of OE and the ε-optimal policies must exist. 本文考虑的是可数状态空间不完全信息的非平稳MDP平均模型,借助于模型的转化,建立了不完全信息的非平稳MDP平均模型的最优方程,并进一步给出了最优方程的解及ε(>,0)——最优策略存在的充分条件。 短句来源 更多 optimality equation
 (3) when Q~*=K+bc, if the optimality equation has a solution with period b, the never to order is an optimal policy; otherwise, there is a modified (s,S) optimal policy. (3) Q~*=K+bc时,若最优方程有周期为b的解v(x),则永不订购为最优策略,否则存在修改的(s,S)型最优策略。 短句来源 Finally, the relationship between the optimal policy π_0~∞ and the optimality equation is discussed. 最后本文讨论了π_0~∞为最优策略与最优方程的关系。 短句来源 Based on performance potential theorem and Bellman optimality equation, it is easy to establish optimality equation, which we call performance potential-based Bellman optimality equation, for both average-cost and discounted-cost performance criteria. 基于性能势理论及Bellman最优方程,很容易建立平均代价和折扣代价性能准则下的最优性方程,称之为基于性能势的Bellman最优方程。 短句来源 In this paper, we discuss finite state space, finite act space markov decisionprocesses with target level criteria, a dynamic programming optimality equation is given. 该文讨论了有限状态空间，有限行动空间MDP的目标水平准则； 给出了动态规划的最优方程. 短句来源 optimality equations
 The optimality equations for the model are established. The existence of ε-optimal policy is proved. 建立了模型的最优方程,证明了ε(>0)最优马氏策略的存在性; 短句来源 Through the transformation of models, the semi-Markov decision programming and the continuous time MDP are transformed to the discrete time MDP respectively, with the optimality equations kept equivalent, so that the most results in the discrete time MDP can be extended to the two other MDP models. 转换保持模型间的最优方程等价,后一转换甚至保持平均目标函数等价。 因此,离散时间MDP申的大部分结论可轻易地推广到另两类MDP中去。 短句来源 A nonstationary Markov decision processes with average cost is investigated in the case of the general state space. The results of the optimality equations for average cost established by the optimality equations of a complement discounted model under the case of stationary are extended to the case of nonstationary. By use of this result,the existence of an optimal policy is proved. 本文研究了在一般状态空间具有平均费用的非平稳Markov决策过程，把在平稳情形用补充的折扣模型的最优方程来建立平均费用的最优方程的结果，推广到非平稳的情形．利用这个结果证明了最优策略的存在性． 短句来源 In this paper, a non-stationary discounted Markovian Decision model with unbounded rewards is investigated, in which the discount factor β_t is dependent of the state and the action taken before last step of the system, under some assumptions, the optimality equations are established, and the existence of an ε-optimal policy is proved. 讨论了无界报酬非时齐折扣马氏决策模型，且折扣因子βt依赖于前一阶段所处的状态和采取的行动，从而推广了常数折扣因子的马氏决策模型，在一定的假设下，得到了最优方程，证明了存在ε-最优马氏策略。 短句来源 This paper first investigates the finited horizon non-Markovican decision processes,where the transition probabilities have no longer Markov property. The optimality equations for the model are extablished. The existeme of ε optimal policies is proved. 建立了一类转移概率依赖于历史的有限阶段决策规划模型（即有限阶段非马氏决策规划模型），并对其ε最优策略问题进行了讨论．给出相应的最优方程，证明了确定性ε最优策略的存在性，最后得到求ε最优策略的算法并证明了该算法的有效性． 短句来源 optimum equation
 Several evironmental factors affecting milking were analysed by multiple regression method and an optimum equation was suggested for the main factors affecting grain weight. 对于影响灌浆的多种外界因素,经过多元逐步回归分析,提出了影响粒重的主要因素最优方程。 短句来源 查询“最优方程”译词为其他词的双语例句 查询“最优方程”译词为用户自定义的双语例句

 Choice of an optimal equation for poplar may be based on the contemplated aims. You can also maintain the cost-optimal equation and use fewer 'C40 nodes at the price of higher running time. optimality equation
 In this stochastic stopping model, we prove that there exists an optimal deterministic and stationary policy and the optimality equation has a unique solution. It is shown that both value functions satisfy the optimality equation and upper and lower bounds as well as conditions for equality for these functions are presented. Under a Lyapunov function condition, we show that stationary policies obtained from the average reward optimality equation are not only average reward optimal, but indeed sample path average reward optimal, for almost all sample paths. For the case of the switching arms, only of one which creates rewards, we solve explicitly the average optimality equation and prove that a myopic policy is average optimal. We establish also a lexicographical policy improvement algorithm leading to Blackwell optimal policies and the relation between such policies and the Blackwell optimality equation. 更多 optimality equations
 An analogy between the optimality equations and the governing equations for a set of certain static beams permits obtaining numerical solutions to the optimal control problem with the help of standard 'structural' FEM software. From the optimality equations which are provided in this paper, we translate the average variance criterion into a new average expected cost criterion. Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions The approach uses an analogy between the optimality equations for control in the time domain and the governing equations for a set of static beams in the spatial domain. Moreover,necessary andsufficient conditions are given so that the optimality equations have a bounded solution with an additional property. 更多 optimum equation
 An extensive survey was made of the adaptive character and species 本文在广泛调查分析的基础上,通过数理统计方法,确定在珠江三角洲范围内,影响林木生长的主要土壤因子是pH、全盐量和水位状况。用逐步回归方法选出了最优方程。落羽杉(Taxodium distichum)、池杉(Taxodium ascendens)、木麻黄(Casua-rina equisetifolia)和水松(Glyptostrobus pensilis)是珠江三角洲农田防护林的主要造林树种。土壤反应的最佳适生值,落羽杉和池杉为pH6.1,木麻黄和水松为pH7.0—8.0。这4种树种都有一定的抗盐能力,在全盐量高达0.38％的盐渍土上,仍生长正常。但落羽杉和池杉的生长速度与土壤全盐呈负相关。耐水湿树种落羽杉、池杉和水松分别在离水面50厘米和30厘米的种植点上表现最佳。珠江三角洲可划分为5个立地类型,即低丘淋溶型;平原淡水型;河网潮灌型;滨海反酸型;滨海砂土型。根据不同的立地类型,提出了适宜的农田防护林造林树种,为适地适树,提供依据。 Wheat cultivars of different genotypes selected from different places showed in a three year experiment a common law in their milking stage that the accumulation of dry material during this stage was done at a slow—fast—slow speed, in accordance with a "S" type distribution. Each cultivar was found to have a specific peak milking stage, during which the entrance of nutrition into grains tended to follow an indicial equation of y=ab~x. Several evironmental factors affecting milking were analysed by multiple regression method and an optimum equation was suggested for the main factors affecting grain weight. The relationship between the source, distribution, and pool was studied. Some cultivars with goodmilking were selected out. 选用各地不同基因型小麦品种,经三年连续试验,初步揭示出不同品种在灌浆期的共同规律:籽粒干物质累积按慢—快—慢顺序,呈"s"形分布;不同品种各有灌浆高峰期。此期内营养物质流进入籽粒是符合y=ab~x方程。对于影响灌浆的多种外界因素,经过多元逐步回归分析,提出了影响粒重的主要因素最优方程。研究了灌浆期源、库、流关系,并探明若干具有优良灌浆性状的品种。

