What WebArena teaches about evaluating autonomous web agents in complex, stateful online environments.