You need to amend your xpath since not all td
elements have class="data"
.Try this xpath expression: //td//text()
.
import urllibfrom lxml import etreebudgeturl = "http://www.the-numbers.com/movie/budgets/all"s = urllib.urlopen(budgeturl).read()htmlpage = etree.HTML(s)htmltable = htmlpage.xpath("//td//text()")