Skip to content Skip to sidebar Skip to footer
Showing posts with the label Screen Scraping

Xpath: "exclude" Tag In "innerhtml" (innerhtmlexcludeme

I am using XPath to query HTML sites, which works pretty good so far, but now I hit a (brick)wall a… Read more Xpath: "exclude" Tag In "innerhtml" (innerhtmlexcludeme

Some Help Scraping A Page In Java

I need to scrape a web page using Java and I've read that regex is a pretty inefficient way of … Read more Some Help Scraping A Page In Java

Web Scrape Live Chaning Data

I am pretty new to web scraping. It is pretty easy with static content, but I would like to know if… Read more Web Scrape Live Chaning Data

A Php Html Parser That Lets Me Do Class Select And Get Parent Nodes

So I'm in a situation where I am scraping a website with PHP and I need to be able to get a nod… Read more A Php Html Parser That Lets Me Do Class Select And Get Parent Nodes

Python Lxml.html Xpath "attribute Not Equal" Operator Not Working As Expected

I'm trying to run the following script: #!python from urllib import urlopen #urllib.request fo… Read more Python Lxml.html Xpath "attribute Not Equal" Operator Not Working As Expected

Scraping Hidden Html (when Visible = False) Using Hpricot (ruby On Rails)

I've come across an issue which unfortunately I can't seem to surpass, I'm also just a … Read more Scraping Hidden Html (when Visible = False) Using Hpricot (ruby On Rails)