Hello,
I'm trying to get movie info from http://www.filmaffinity.com. I made a spider called filmaffinity.com.txt with this content:
limit=0
url=http://www.filmaffinity.com/es/search.php?stext=%searchstring%&stype=all
results=<b><a href="(?<url>/es/.*?)".*?>(?<display>.*?)</a>
to try it I searched for "terminator" movie but I can't see any of the five results I expected.
The code to look into with regex expression is:
<a class="addl" href="/es/edtmovielists.php?movie_id=517417&rp=%2Fes%2Fsearch.php%3Fstext%3Dterminator%26stype%3Dall">AƱadir a listas</a> <b><a href="/es/film517417.html">Terminator Salvation</a></b> (2009) <img src="/imgs/countries/US.jpg" title="Estados Unidos" border="0" align="middle"></td></tr>
May the problem be the url is relative /es/film517417.html instead of absolute http://www.filmaffinity.com/es/film517417.html ?
Can anybody help me?
Thanks in advance.