Forgot your password?
typodupeerror

Comment 2009 Reinforcement Learning Competition (Score 3, Interesting) 110

Just thought I'd point out that we also did this in the 2009 Reinforcement Learning Competition (I was the general chair):

We also used Infinite Mario Bros, but combined it with the RL-glue coding framework to make the interface easier. That way, a well-coded agent is automatically compatible with any other domain that is RL-glue compatible.

The prizes were also comparable: ~$450 for the first place team, ~$250 for the second place team.

The results were interesting: far from developing interesting and novel RL algorithms, most competitors used clever feature engineering combined with dimensionality reduction to reduce the full Mario problem to a simpler one that could be solved efficiently using existing RL algorithms that are robust and well understood.

One of the big lessons that we took away from this was that we haven't solved the mechanism design problem of competitions in AI. While Mario sounds like a good "grand challenge" problem for RL / AI, it turns out that simple heuristics work pretty well. I think this is a common problem for most of these competitions -- there's the Trading Agent Competition, there's Netflix, there's the General Game Playing Competition, etc. They all have the same goals, and they all have the same problem: competitors engineer algorithms to solve the competition, not to spur progress in general AI. These games are all a proxy for what we really care about (like the Turing test), and the proxy isn't perfect (like the Turing test).

I think the only way to get around this is to craft a domain that mimics the real world, because then if anyone "solves the competition," you've made progress on what you really care about.

It would be interesting to design a competition with these goals in mind. Maybe an extraordinary complex simulator based on a physics engine (Bullet or Havok) would be a step in the right direction -- different objects with continuous, high-dimensional state spaces and complex material properties (some are soft, some are rigid, some break, etc); interesting physical interactions between objects (collisions, joints, hinges, stacking, breaking, etc.); multiple levels of spatio-temporal abstraction (from low-level motor control to abstract tasks) and a strong vision component. Now that would be a cool competition!

David Wingate
wingated@mit.edu

The Internet

Researchers Warn of Possible BitTorrent Meltdown 294

secmartin writes "Researchers at Delft University warn that large parts of the BitTorrent network might collapse if The Pirate Bay is forced to shut down. A large part of the available torrents use The Pirate Bay as tracker, and other available trackers will probably be overloaded if all traffic is shifted there. TPB is currently using eight servers for their trackers. According to the researchers, even trackerless torrents using the DHT protocol will face problems: 'One bug in a DHT sorting routine ensures that it can only "stumble upon success", meaning torrent downloads will not start in seconds or minutes if Pirate Bay goes down in flames.'"
Government

Cuba Launches Own Linux Variation 494

willclem writes "According to Reuters, it seems that Cuba has launched its own variation of Linux in order to fulfill its government's desire to replace Microsoft operating systems. 'Getting greater control over the informatic process is an important issue,' said Communications Minister Ramiro Valdes, who heads a commission pushing Cuba's migration to free software."
NASA

Large Caves Found on the Surface of Mars 191

David DelMonte writes "Space.com is reporting on the discovery of seven dark spots near the Equator on Mars. The thinking is that these are cave openings. The openings are the size of football fields, and one of them is thought to extend approximately 400 feet below the surface.'The researchers hope the discovery will lead to more focused spelunking on Mars. "Caves on Mars could become habitats for future explorers or could be the only structures that preserve evidence of past or present microbial life ," said Glenn Cushing of Northern Arizona University, who first spotted the black areas in the photographs.'"

Slashdot Top Deals

Each honest calling, each walk of life, has its own elite, its own aristocracy based on excellence of performance. -- James Bryant Conant

Working...