How to extract URLs from a file?
qubodup, January 3rd, 2009
grep -o ‘xmlUrl.*’ file.html | sed ‘s/href=”//’ | sed ‘s/”.*//’
In this case all href= links get extracted from a HTML file. I used it slightly changed (by replacing “href”) to extract links from an XML file.