A Study on Abstract Policy for Acceleration of Reinforcement Learning
Reinforcement learning (RL) is well known as one of the methods that can be applied to unknown problems. However, because optimization at every state requires trial-and-error, the learning time becomes large when environment has many states. If there exist solutions to similar problems and they are...
| Main Authors: | Ahmad Afif, Mohd Faudzi, Hirotaka, Takano, Junichi, Murata |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
2014
|
| Subjects: | |
| Online Access: | http://umpir.ump.edu.my/id/eprint/7452/ http://umpir.ump.edu.my/id/eprint/7452/1/A_Study_on_Abstract_Policy_for_Acceleration_of_Reinforcement_Learning.pdf |
Similar Items
A study on Visual Abstraction for Reinforcement Learning Problem Using Learning Vector Quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2013)
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2013)
Transfer learning through policy abstraction using learning vector quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2018)
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2018)
Transfer learning through abstraction using learning vector quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2017)
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2017)
Policy abstraction for transfer learning using learning vector quantization in reinforcement learning framework
by: Ahmad Afif, Mohd Faudzi
Published: (2015)
by: Ahmad Afif, Mohd Faudzi
Published: (2015)
Accelerating Erasure Code with Multi-Processing
by: Loh, Hong Khai
Published: (2016)
by: Loh, Hong Khai
Published: (2016)
Accelerating graph algorithms with priority queue processor
by: Heng Sun, Ch'ng, et al.
Published: (2006)
by: Heng Sun, Ch'ng, et al.
Published: (2006)
Experimental Investigation on Vegetative Oils under Accelerated Thermal Ageing against Their Dielectric Strength
by: Siti Sufiah, Abd Wahid, et al.
Published: (2017)
by: Siti Sufiah, Abd Wahid, et al.
Published: (2017)
Accelerated Verilog Simulator Using Application Specific Microprocessor
by: Tan Tze Sin, Tze Sin
Published: (2017)
by: Tan Tze Sin, Tze Sin
Published: (2017)
Public transport route optimization with reinforcement learning
by: Tay, Bee Sim
Published: (2023)
by: Tay, Bee Sim
Published: (2023)
Reinforcement learning approach for centralized cognitive radio systems
by: Yau, Alvin Kok-Lim *
Published: (2012)
by: Yau, Alvin Kok-Lim *
Published: (2012)
A Numerical Approach to the Efficient Analysis of2D RF-MEMS Capacitor with Accelerated Motion
by: Shafrida, Sahrani, et al.
Published: (2009)
by: Shafrida, Sahrani, et al.
Published: (2009)
Efficient accelerated simulation technique for packet switched networks : a buffer with two priority inputs
by: Ariffin, Sharifah H. S., et al.
Published: (2004)
by: Ariffin, Sharifah H. S., et al.
Published: (2004)
An acceleration simulation method for power law priority traffic
by: H. S. Ariffin, Sharifah, et al.
Published: (2008)
by: H. S. Ariffin, Sharifah, et al.
Published: (2008)
Graph processing hardware accelerator for shortest path algorithms in nanometer very large-scale integration interconnect routing
by: Ch'ng, Heng Sun
Published: (2007)
by: Ch'ng, Heng Sun
Published: (2007)
Let’s tasmik mobile apps : Quran tasmik application using deep learning
by: Nor Nabilah, Mustapha, et al.
Published: (2021)
by: Nor Nabilah, Mustapha, et al.
Published: (2021)
Fpga-Based Accelerator for The Generation of Pseudo-Amino Acid Composition
by: Ching, Chee Chow
Published: (2015)
by: Ching, Chee Chow
Published: (2015)
Material characterization model using rectangular resonator waveguide
by: Mohamad Shaiful, Abdul Karim, et al.
by: Mohamad Shaiful, Abdul Karim, et al.
Simultaneous computation of model order and parameter estimation for system identification based on opposition-based simulated Kalman filter
by: Badaruddin, Muhammad, et al.
Published: (2018)
by: Badaruddin, Muhammad, et al.
Published: (2018)
Vision-based autonomous robot body alignment for copper wire spool pick up
by: Daud, Mohd Razali, et al.
Published: (2019)
by: Daud, Mohd Razali, et al.
Published: (2019)
A study on different techniques in ALPR system : The systems performance analysis
by: Vi, Gan Vi, et al.
Published: (2022)
by: Vi, Gan Vi, et al.
Published: (2022)
Implementation of generalized predictive control (GPC) for a real-time process control using labview
by: Mohd. Faudzi, Ahmad 'Athif
Published: (2006)
by: Mohd. Faudzi, Ahmad 'Athif
Published: (2006)
Conceptual design of a generic nodal abstraction (GNA) for a human-agent collaboration systems
by: Mohammed, Khudhair Abbas, et al.
Published: (2019)
by: Mohammed, Khudhair Abbas, et al.
Published: (2019)
Automatic detection of diabetic retinopathy retinal images using artificial neural network
by: Syamimi Mardiah, Shaharum, et al.
Published: (2019)
by: Syamimi Mardiah, Shaharum, et al.
Published: (2019)
Development of automated gate using automatic license plate recognition system
by: Luai Taha Ahmed, Al-Mahbashi, et al.
Published: (2018)
by: Luai Taha Ahmed, Al-Mahbashi, et al.
Published: (2018)
An automatic transfusion set for accelerating inoculation process of agarwood artificial inducer
by: Roslee, Muhammad Nurrifat, et al.
Published: (2018)
by: Roslee, Muhammad Nurrifat, et al.
Published: (2018)
Design of Ultra-Wideband (UWB) horn antenna for non-destructive fruit quality monitoring
by: Siti Fatihah, hazali, et al.
Published: (2018)
by: Siti Fatihah, hazali, et al.
Published: (2018)
Knowledge-based disk scheduling policy using fuzzy logic
by: Abu Talip, Mohamad Sofian, et al.
Published: (2010)
by: Abu Talip, Mohamad Sofian, et al.
Published: (2010)
Smart cities in India: Features, policies, current status, and challenges
by: Manoj Kumar, Nallapaneni, et al.
Published: (2018)
by: Manoj Kumar, Nallapaneni, et al.
Published: (2018)
On the policy of photovoltaic and diesel generation mix for an off-grid site: east malaysian perspectives
by: Ajan, Christopher W., et al.
Published: (2003)
by: Ajan, Christopher W., et al.
Published: (2003)
Design of T-shaped UWB antenna with dual band rejection using inverted u- and c-shaped slots
by: Salwa, Awang Akbar, et al.
Published: (2018)
by: Salwa, Awang Akbar, et al.
Published: (2018)
Malaysian vehicle license plate recognition using deep learning and computer vision
by: Pugalenthy, Kuken Raj, et al.
Published: (2022)
by: Pugalenthy, Kuken Raj, et al.
Published: (2022)
A development of a dielectric composite substrate based on barium titanate-epoxy resin for a 5 GHZ microstrip antenna
by: Nur Sofia Idayu, Didik Aprianto, et al.
Published: (2025)
by: Nur Sofia Idayu, Didik Aprianto, et al.
Published: (2025)
Effect of fibre loading on the flexural properties of natural fibre reinforced polymer composites
by: Mathivanan, Davindrabrabu, et al.
Published: (2015)
by: Mathivanan, Davindrabrabu, et al.
Published: (2015)
Electronic components detection and recognition using deep learning for learning purpose
by: Wan Nur Azhani, Wan Samsudin, et al.
Published: (2021)
by: Wan Nur Azhani, Wan Samsudin, et al.
Published: (2021)
Evaluation of different control policies of semi-active MR fluid damper of a quarter-car model
by: Rahman, Mahmudur, et al.
Published: (2012)
by: Rahman, Mahmudur, et al.
Published: (2012)
Cured epoxy resin dielectric characterization based on accurate waveguide technique
by: Nurulfadzilah, Hasan, et al.
Published: (2019)
by: Nurulfadzilah, Hasan, et al.
Published: (2019)
Determining faculty policy in franchising university program : evaluating lecturers' assessment capacity as a basis for student enrolment
by: Yahaya, Nazli, et al.
Published: (2007)
by: Yahaya, Nazli, et al.
Published: (2007)
Factorial analysis on the preparation of barium titanate-epoxy resin composite for antenna substrate
by: Nur Sofia Idayu, Didik Aprianto, et al.
Published: (2024)
by: Nur Sofia Idayu, Didik Aprianto, et al.
Published: (2024)
Traffic control strategy for adaptive signal controller based on reinforcement learning and local communication channel
by: Muaid, Abdulkareem Alnazir Ahmed
Published: (2023)
by: Muaid, Abdulkareem Alnazir Ahmed
Published: (2023)
Deep learning based human presence detection
by: Venketaramana, Balachandran, et al.
Published: (2020)
by: Venketaramana, Balachandran, et al.
Published: (2020)
Similar Items
-
A study on Visual Abstraction for Reinforcement Learning Problem Using Learning Vector Quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2013) -
Transfer learning through policy abstraction using learning vector quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2018) -
Transfer learning through abstraction using learning vector quantization
by: Ahmad Afif, Mohd Faudzi, et al.
Published: (2017) -
Policy abstraction for transfer learning using learning vector quantization in reinforcement learning framework
by: Ahmad Afif, Mohd Faudzi
Published: (2015) -
Accelerating Erasure Code with Multi-Processing
by: Loh, Hong Khai
Published: (2016)