2025 01 23

Are LLMs good at resolving coreference between people in complicated stories? Our benchmark paper, IdentifyMe, accepted to NAACL 2025 indicates not! arXiv