Developing an Amazon Movie Scraper

  Thread Rating:
  • 1 Votes - 5 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
gyrene2083 Offline
Senior Member
Posts: 200
Joined: Oct 2008
Reputation: 0
Location: New York City
Post: #31
jelockwood,

I am looking forward to testing your scraper out. I have been following this thread since Oct. I have found many of the DVD's on IMDB don't have coverart, where as Amazon does, as well as missing dvd information. I appreciate all your efforts, and I am looking forward to testing this out.

Advice on Hardware
PC-Shuttle XPC SP35P2 CPU-Intel E8400 3Ghz RAM-4GB Video Card-Nvidia 9400GT
OS-Win7 32bit - XBMC PRE-11.0 Git:20110417-1f1bbfa (Compiled:Jul 2 2011) SKIN - Aeon NOX Keyboard - DiNovo Mini

-Semper Fi
gyrene2083
find quote
jelockwood Offline
Senior Member
Posts: 111
Joined: Mar 2008
Reputation: 0
Post: #32
gyrene2083 Wrote:jelockwood,

I am looking forward to testing your scraper out. I have been following this thread since Oct. I have found many of the DVD's on IMDB don't have coverart, where as Amazon does, as well as missing dvd information. I appreciate all your efforts, and I am looking forward to testing this out.

If the only problem is cover art, then you could use the IMDB scraper, and manually select a local picture, or put a picture in the directory with a .tbn file extension. I wrote the scrapers because some titles are not listed at all on IMDB and I still wanted to include them in the XBMC library.

The download link is now live so you can give it a go.
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,233
Joined: Nov 2003
Reputation: 82
Post: #33
both are now sitting in svn (r16563). cheers again!

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
jelockwood Offline
Senior Member
Posts: 111
Joined: Mar 2008
Reputation: 0
Post: #34
I just tried using the Amazon scrapers I mostly wrote for the first time for several weeks, and damn they don't work any more for me.

Currently neither is finding any results (so it is not simply an issue of scraping info from a selected result). This was the original problem that I had (constructing a correct query in the scraper, and then getting/showing the list of results). This was originally solved by C-Quel generously providing his original Amazon scraper effort which I then finished off.

Could anyone else confirm whether the Amazon scrapers (either US or UK) are currently still working for them, and if so what DVD title they used successfully.

If on the other hand, other users confirm it is broken, would anyone be able to assist in diagnosing it?

What held me up last time, is that I could not (without a LAN packet sniffer) see what request the scraper sent out, and what result it got back from Amazon and then be able to see how far it got. Once I got past this and moved on to scraping the film info, this could be easily tested by seeing how many fields successfully returned results.
find quote
C-Quel Offline
Retired Team-XBMC Member
Posts: 1,378
Joined: Aug 2004
Reputation: 0
Post: #35
Try this...

change Get SearchResults from

imageColumn"[^:]*a href="([^"]*)"[^:]*[^>]*alt="([^"]*)"

productTitle"><a href="([^"]*)"> ([^<]*)</a>

or properly formatted

productTitle&quot;&gt;&lt;a href=&quot;([^&quot;]*)&quot;&gt; ([^&lt;]*)&lt;/a&gt;

might not be perfect as i simply glanced at amazon no tools to hand.

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.


[Image: badge.gif]

If scraper related please always grab the latest XML relevant to the content you are trying to grab info for from this link https://xbmc.svn.sourceforge.net/svnroot...m/scrapers

System Specs:

A Computer with loads of shiny things that make a noise and bring life to my tv, and xbmc ofc :)

iNerd Store

iNerd Forum
find quote
jelockwood Offline
Senior Member
Posts: 111
Joined: Mar 2008
Reputation: 0
Thumbs Up    Post: #36
I have already thanked C-Quel (again) via a private message, but this fix does so far look successful. I will do some more testing and then put updated versions on my download page and issue a request for them to be included as updated and fixed versions in XBMC.

Many thanks again to C-Quel and everyone else who has helped out in the past.

C-Quel Wrote:Try this...

change Get SearchResults from

imageColumn"[^:]*a href="([^"]*)"[^:]*[^>]*alt="([^"]*)"

productTitle"><a href="([^"]*)"> ([^<]*)</a>

or properly formatted

productTitle&quot;&gt;&lt;a href=&quot;([^&quot;]*)&quot;&gt; ([^&lt;]*)&lt;/a&gt;

might not be perfect as i simply glanced at amazon no tools to hand.
find quote
vdrfan Offline
Team-XBMC Developer
Posts: 2,793
Joined: Jan 2008
Reputation: 7
Location: Germany
Post: #37
jelockwood Wrote:I have already thanked C-Quel (again) via a private message, but this fix does so far look successful. I will do some more testing and then put updated versions on my download page and issue a request for them to be included as updated and fixed versions in XBMC.

Many thanks again to C-Quel and everyone else who has helped out in the past.

Please use our tracker instead and attach a unified diff to the previous scraper.

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules
For troubleshooting and bug reporting please make sure you read this first.
find quote
ultrabrutal Offline
Posting Freak
Posts: 954
Joined: Feb 2005
Reputation: 0
Location: South of Heaven
Post: #38
Amazon does not give permission to get info via http. They have a webservice to use which is legal, however you have to delete the info after 3 months... hehe this means that movies should automaticly start to disappear from the library if they were scanned via Amazon webservice scrapper Wink
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,233
Joined: Nov 2003
Reputation: 82
Post: #39
both scrapers disabled in svn

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
nekrosoft13 Offline
Fan
Posts: 473
Joined: Dec 2008
Reputation: 1
Post: #40
ultrabrutal Wrote:Amazon does not give permission to get info via http. They have a webservice to use which is legal, however you have to delete the info after 3 months... hehe this means that movies should automaticly start to disappear from the library if they were scanned via Amazon webservice scrapper Wink

you gonna ruin everything
find quote
Post Reply