17.2 Measuring Incremental Progress Toward Human-Level AGI 311 — Example task: Ask the robot about events that occurred at times when it got partic- ularly much, or particularly little, reward for its actions; it should be able to answer simple questions about these, with significantly more accuracy than about events occurring at random times 4, Learning e Imitation: Spontaneously adopt new behaviors that it sees others carrying out — Example task: Learn to build towers of blocks by watching people do it e Reinforcement: Learn new behaviors from positive and/or negative reinforcement signals, delivered by teachers and/or the environment — Example task: Learn which box the red ball tends to be kept in, by repeatedly trying to find it and noticing where it is, and getting rewarded when it finds it correctly ¢ Imitation/Reinforcement — Example task: Learn to play “fetch”, “tag” and “follow the leader” by watching people play it, and getting reinforced on correct behavior e Interactive Verbal Instruction — Example task: Learn to build a particular structure of blocks faster based on a combination of imitation, reinforcement and verbal instruction, than by imitation and reinforcement without verbal instruction e Written Media — Example task: Learn to build a structure of blocks by looking at a series of diagrams showing the structure in various stages of completion e Learning via Experimentation — Example task: Ask the robot to slide blocks down a ramp held at different angles. Then ask it to make a block slide fast, and see if it has learned how to hold the ramp to make a block slide fast. 5. Reasoning e Deduction, from uncertain premises observed in the world — Example task: If Ben more often picks up red balls than blue balls, and Ben is given a choice of a red block or blue block to pick up, which is he more likely to pick up? e Induction, from uncertain premises observed in the world — Example task: If Ben comes into the lab every weekday morning, then is Ben likely