urls, 1 per line. First 500 chars will be extracted and the language will be detected from each