Show HN: Beating Pokemon Red with RL and <10M Parameters
(drubinstein.github.io)
COMMENTS:
I think they allude to this in their conclusion, but it's less about the low-hanging fruit and more about designing a system to feedback game dialogue into the RL decision making process in a way that can be mutated as part of the RL(be it an LLM or something else)
This is what I want to see more of and goes against the hype of LLMs. What a great RL project.
Meanwhile, "Claude" is still stuck somewhere in the game. Imagine the costs of running that vs this project.
brains basically have “modules” like this as well - neuronal columns that handle specialised tasks. For example when you’re driving on the road, the understanding whether the distance between you and the vehicle in front is increasing or decreasing is a finely tuned function of a specialised part of the brain.
(and how on earth did you port Pokémon red to a RL environment? O.o)
> (and how on earth did you port Pokémon red to a RL environment? O.o)
Read and find out :)
1. Doing things humans do for fun. 2. Doing things that AI is horribly terrible at.
?
Autonomous drones
Financial fraud detection
Scheduling of trains/buses/etc
I personally do like chatbots but you probably don't
I wonder, does anyone have a sense of the approximate raw number of button presses required to beat the game? Mostly curious to see how that compares to the parameter count.
Maybe some day the “rival” character in Pokemon can be played by a RL system, haha. That way you can have a “real player (simulated)” for your rival.
I know all about rl. Ive read go-explore 1/2, and I have personally implemented intrinsic curiosity.
I was just commenting on what rhe other person said, which is that it would be cool to have the npcs be agents that battle and train too, to which you said they could not be made to, to which I say, we have the technology. :)
Seriously? I've never really played video games, but i remember spending so much time on pokemon red when i was young. Not sure if i ever really finished more than once. But i'm pretty sure i must have played for more than 50h or so before even close to finish. My memory might trick me though.
Not sure which pokemon version it was, but i got so hooked trying to get this "secret" pokemon which was just a bunch of pixels. Some kind of bug (of the game, not the type of pokemon). You had to do specific things in a park and other things and then surf up and down x-times on the right shore of an island... or something like that. I had no idea how it worked and got so hooked, i must have spent most of my playing time on things like that.
Oh boy, memories...
The glitched Pokemon you're talking about is Missingno by the way! I remember surfing up and down Cinnabar Island to do the same thing.
item_43269330