Location of the xml file (just copy-paste the url underneath the crawl result)

ouput: full urls only hosts