Wolfram Technology Guide: High-Level String Computation  previous | next 
Automatic HTML Import
Mathematica can automatically extract data from raw HTML pages.
In[1]:=

Click for copyable input
Import["http://www.forbes.com/2003/09/17/rich400land.html", "Data"]
Out[1]=