+

Ng et al., 2013 - Google Patents

PEGASUS: A policy search method for large MDPs and POMDPs

Ng et al., 2013

View PDF
Document ID
6314186060144292150
Author
Ng A
Jordan M
Publication year
Publication venue
arXiv preprint arXiv:1301.3878

External Links

Snippet

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a model. Our approach is based on the following observation: Any (PO) MDP can be …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • G06F17/30533Other types of queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2217/00Indexing scheme relating to computer aided design [CAD]
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2207/00Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models

Similar Documents

Publication Publication Date Title
Ng et al. PEGASUS: A policy search method for large MDPs and POMDPs
Agrawal et al. Abcd-strategy: Budgeted experimental design for targeted causal structure discovery
EP4006788A1 (en) Quantum circuit determining method and apparatus, device, and storage medium
Langville et al. Updating Markov chains with an eye on Google's PageRank
Murphy A variational approximation for Bayesian networks with discrete and continuous latent variables
CN111475848B (en) Global and local low-noise training methods for data privacy in edge computing
Ruta et al. A theoretical analysis of the limits of majority voting errors for multiple classifier systems
CN109214502B (en) Neural network weight discretization method and system
Shonkwiler et al. Parallel speed-up of monte carlo methods for global optimization
Ghosh et al. A FETI‐preconditioned conjugate gradient method for large‐scale stochastic finite element problems
Campos Pinto et al. Convergence of a linearly transformed particle method for aggregation equations
KR20220061835A (en) Apparatus and method for hardware acceleration
Hinkelmann et al. Inferring biologically relevant models: nested canalyzing functions
CN112395428A (en) Method and system for complementing knowledge graph entity abstract based on set
Du et al. Geometric matrix completion via Sylvester multi-graph neural network
González et al. Mathematical modeling of discrete estimation of distribution algorithms
Obenland et al. Simulating the effect of decoherence and inaccuracies on a quantum computer
Palaniswamy et al. Improved threshold logic synthesis using implicant-implicit algorithms
CN116579437B (en) Quantum circuit training method and device, storage medium and electronic device
Malji et al. Significance of entropy correlation coefficient over symmetric uncertainty on FAST clustering feature selection algorithm
CN114723167B (en) A short-term vehicle speed prediction method based on BiLSTM-RVFL model
Ertl et al. Design and optimisation of an efficient HDF5 I/O Kernel for massive parallel fluid flow simulations
CN107844461A (en) A kind of Gaussian process based on broad sense N body problems returns computational methods
Inthachot et al. A multi-subpopulation particle swarm optimization: a hybrid intelligent computing for function optimization
Burnaev et al. Adaptive design of experiments for sobol indices estimation based on quadratic metamodel
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载