Learning conventions via social reinforcement learning in complex and open settings