Ng et al., 2013 - Google Patents
PEGASUS: A policy search method for large MDPs and POMDPsNg et al., 2013
View PDF- Document ID
- 6314186060144292150
- Author
- Ng A
- Jordan M
- Publication year
- Publication venue
- arXiv preprint arXiv:1301.3878
External Links
Snippet
We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a model. Our approach is based on the following observation: Any (PO) MDP can be …
- 241001596784 Pegasus 0 title description 12
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2217/00—Indexing scheme relating to computer aided design [CAD]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2207/00—Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Ng et al. | PEGASUS: A policy search method for large MDPs and POMDPs | |
| Agrawal et al. | Abcd-strategy: Budgeted experimental design for targeted causal structure discovery | |
| EP4006788A1 (en) | Quantum circuit determining method and apparatus, device, and storage medium | |
| Langville et al. | Updating Markov chains with an eye on Google's PageRank | |
| Murphy | A variational approximation for Bayesian networks with discrete and continuous latent variables | |
| CN111475848B (en) | Global and local low-noise training methods for data privacy in edge computing | |
| Ruta et al. | A theoretical analysis of the limits of majority voting errors for multiple classifier systems | |
| CN109214502B (en) | Neural network weight discretization method and system | |
| Shonkwiler et al. | Parallel speed-up of monte carlo methods for global optimization | |
| Ghosh et al. | A FETI‐preconditioned conjugate gradient method for large‐scale stochastic finite element problems | |
| Campos Pinto et al. | Convergence of a linearly transformed particle method for aggregation equations | |
| KR20220061835A (en) | Apparatus and method for hardware acceleration | |
| Hinkelmann et al. | Inferring biologically relevant models: nested canalyzing functions | |
| CN112395428A (en) | Method and system for complementing knowledge graph entity abstract based on set | |
| Du et al. | Geometric matrix completion via Sylvester multi-graph neural network | |
| González et al. | Mathematical modeling of discrete estimation of distribution algorithms | |
| Obenland et al. | Simulating the effect of decoherence and inaccuracies on a quantum computer | |
| Palaniswamy et al. | Improved threshold logic synthesis using implicant-implicit algorithms | |
| CN116579437B (en) | Quantum circuit training method and device, storage medium and electronic device | |
| Malji et al. | Significance of entropy correlation coefficient over symmetric uncertainty on FAST clustering feature selection algorithm | |
| CN114723167B (en) | A short-term vehicle speed prediction method based on BiLSTM-RVFL model | |
| Ertl et al. | Design and optimisation of an efficient HDF5 I/O Kernel for massive parallel fluid flow simulations | |
| CN107844461A (en) | A kind of Gaussian process based on broad sense N body problems returns computational methods | |
| Inthachot et al. | A multi-subpopulation particle swarm optimization: a hybrid intelligent computing for function optimization | |
| Burnaev et al. | Adaptive design of experiments for sobol indices estimation based on quadratic metamodel |