So, I’m working with Ermo on applying reinforcement learning to text based games. So, I was wondering if eventually if our method works if we could do text based learning from demonstration with reinforcement learning? Basically instead of the user pressing buttons they would describe what they wanted the system to do using english sentences. The user could then be able to say yes or no to what they are doing. Using natural language to train a multiagent system seems like it would be better. Especially since once it works for text, it could naturally be extended to speech! Telling the robots what to do and what to pay attention to would be even better.
Author: Drew Wicke
Bounty Hunting on a forum
Interesting, I found someone talking about bounty hunting as a non-exclusive task allocation mechanism.
http://forums.ltheory.com/viewtopic.php?t=2477&p=38248
Coke Robot
So, many of the people at my robotics lab buy soda from the vending machine. They just increased the price for a drink from 1.50 to 1.75! That is paying 10.50 for a 6 pack! So, we were saying we should just buy a bunch of soda when it goes on sale. So, we of course could make our own dispensary.
Would be terribly fun to implement the bounty hunting task allocation for a Coke delivery robot on GMU’s campus. So, instead of going to a vending machine you would order on your phone and it would bring you your coke product. So you pay for the soda through the app and you get a qrcode that you then present to the robot and it will dispense the soda. Then you could buy a coke for someone and they would just show the code.
Of course this could be done with a ton of things. But since my campus only has coke product it would be really cool if we had this. It would be a great research problem too.
Inhaler
Wow, I took my inhaler a bit ago and I feel amazing now! I think for the past few weeks I must not have been getting enough oxygen. I’ve been tired and a bit slow. For the past few weeks I have been going to the gym regularly and running/ellipticalling/lifting. So, I’ve not been kind to my lungs. I should probably start taking my inhaler before going to the gym… That will probably help. What do you know, doctors are right haha. 🙂
Bounty Hunting and Cloud Robotics
AI and Creativity
So I just read an article stating that AI is nowhere near supplanting artists due to computers inability to “decide what is relevant”. I think that might be giving us AI researchers too much credit or going too soft on us. We have yet to develop non-noisy inputs in order to simulate the emotional and non-functional aspects of the brain. The closest we could get is to teach a computer based off of an FMRI of the brain while experiencing art/music etc. Somewhat simpler is being able to recognize emotions and correlate what is happening with that emotion. That is even more difficult. That is when we are at the point that the machine can put itself in “another’s shoes,” as it were. That is at an entirely different level than where we are at now. So, I don’t disagree with the author, I just think that she is just scratching the surface of what AI is unable to do currently, especially in a general, non-lab setting. However, I believe given better inputs (and of course better algorithms) that machines may develop human like emotions and ability to simulate others situations and thus develop a connection and be inspired to create art. But, I’m pretty sure that won’t happen in my time :(.
http://www.technologyreview.com/view/542281/artificial-creativity/
Lentils Recipe
Lentils recipe I came up with:
- cooked lentils
- herb rice
- orange chicken
- cooked broccoli
- seared pear (cooked with oil and honey alongside cashews and pecans)
- Seasoned with cinnamon, very little nutmeg, and curry.
Haven’t tried it yet but will next week hopefully. High in fiber, protein and vitamin C.
Politics and MAS
Politics seems like a good real world example of the multi-agent inverse problem and trying to get agents to coordinate at a massive (country) scale. Basically the multi-agent inverse problem is determining rules and behaviors at the low level that achieve a higher level objective. This problem is made more difficult because the low level behaviors of agents interact with each other causing possibly unexpected emergent behavior.
Another thing that politics has is hierarchies…
Mainly this was prompted by the article on laws that pertain to the constitution and how they are interpreted.
Puzzle Space
I wonder what the space of problems that we consider puzzles is like. I mean how big is it? What characteristics in general do they have? Does computational complexity correspond to how difficult the puzzle is? Puzzles usually require some degree of logic. So, I’d imagine that puzzles that are most difficult correspond to those that have you solve NP-hard problems without you realizing it. Can we use a language that describes puzzles in general, like maybe some form of logic, and then we can maybe automatically generate puzzles. Interesting especially when you consider multi-agent puzzles.
task resources
I think that I might talk a bit about: