|
Page 1 of 1
|
[ 9 posts ] |
|
| Author |
Message |
|
Jean-Claude
Rookie
Joined: Mon Feb 13, 2012 5:17 am Posts: 7
|
 Q-Learning
Hello. Is there anybody who used Q-learning on the NXT ? Are the memory and the CPU good enough for such an algorithm ? I'd like some advises or some tips before giving it a try. Thank you.
|
| Thu Mar 08, 2012 1:52 pm |
|
 |
|
magicode
Moderator
Joined: Tue Sep 14, 2010 9:19 pm Posts: 495
|
 Re: Q-Learning
May people seem to have accomplished it. Take a look at this publication: http://www.cs.surrey.ac.uk/BIMA/People/ ... arning.pdf
_________________ sudo rm -rf /
|
| Thu Mar 08, 2012 2:23 pm |
|
 |
|
Jean-Claude
Rookie
Joined: Mon Feb 13, 2012 5:17 am Posts: 7
|
 Re: Q-Learning
Amazing speed !!! This document is very interesting. And answers my question : Thank you very much.
|
| Thu Mar 08, 2012 2:41 pm |
|
 |
|
Jean-Claude
Rookie
Joined: Mon Feb 13, 2012 5:17 am Posts: 7
|
 Re: Q-Learning
For those interessed by Q-learning, I just found this good tutorial : http://mnemstudio.org/path-finding-q-learning-tutorial.htm
|
| Thu Mar 08, 2012 2:54 pm |
|
 |
|
magicode
Moderator
Joined: Tue Sep 14, 2010 9:19 pm Posts: 495
|
 Re: Q-Learning
This probably doesn't contribute anything to the conversation, but I it's sort of relevant, so I'll say it anyway. I was programming a simple Q-learning algorithm a while back, I had this really weird moment when I kept thinking about robot ethics. I mean, in Q-learning, you'r basically programming the robot with rewards and punishments. It's like giving it pain or pleasure depending on the action. (I wasn't inebriated, I promise).
_________________ sudo rm -rf /
|
| Thu Mar 08, 2012 9:36 pm |
|
 |
|
RoboDesigners
Novice
Joined: Sat Jul 10, 2010 3:06 pm Posts: 86 Location: Roanoke, VA
|
 Re: Q-Learning
Wow... you have interesting thoughts...  For sake of argument, though... (Even if it were an "ethical" question (I personally don't think robots are bound under the same moral code as humans).) People do the same thing to other people. How does a parent train a child? By rewarding them when they do right, and by punishing them when they do wrong. So, why would it be wrong to punish and reward a robot?  (This could get weird really quick) //Andrew
_________________Check out my website! www.RoboDesigners.comVRC Team 2190 Twitter: @RoboDesigners
|
| Thu Mar 08, 2012 10:20 pm |
|
 |
|
magicode
Moderator
Joined: Tue Sep 14, 2010 9:19 pm Posts: 495
|
 Re: Q-Learning
Yes, interesting is a polite way to put it. And the punishment-reward system wasn't really the focus of what I was pondering about. It more more about how easy it was to change one of those values. With one keystoke, the 'pain' value of a certain action goes up by 50. I know it's not really a question of ethics; it was just a thought experiment.
_________________ sudo rm -rf /
|
| Thu Mar 08, 2012 11:35 pm |
|
 |
|
Jean-Claude
Rookie
Joined: Mon Feb 13, 2012 5:17 am Posts: 7
|
 Re: Q-Learning
In my opinion, rewards and punishments, in education, are not the problem. Even in self learning you use rewards and punishments. When you succeed you're happy (reward), when you do a mistake you're sad or in angry (punishment). The problem is the level of the punishment. All parents are confronted with this question. But, so far, I've never confused my NXT and my children [I hope my English is understandable  ]
|
| Fri Mar 09, 2012 5:33 am |
|
 |
|
RoboDesigners
Novice
Joined: Sat Jul 10, 2010 3:06 pm Posts: 86 Location: Roanoke, VA
|
 Re: Q-Learning
This is true... It is. (In fact, it's better than some native English-speakers who have spent too much time texting on their phones...)  //Andrew
_________________Check out my website! www.RoboDesigners.comVRC Team 2190 Twitter: @RoboDesigners
|
| Fri Mar 09, 2012 9:29 am |
|
|
|
Page 1 of 1
|
[ 9 posts ] |
|
Who is online |
Users browsing this forum: newspaper and 5 guests |
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot post attachments in this forum
|
|