The authors claim the company trained its XGen models on nearly 200,000 pirated books, then scrubbed public disclosures.