IMDB scraper issue with some Italian titles

  Thread Rating:
  • 1 Votes - 5 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #1
Good morning to everybody,

I'd like to report an issue that is happening with the IMDB scraper. When I set "Italy" in the scraper's preferences, in some cases it retrieves the international titles and not the Italian ones, even if in the IMDB page the Italian title is present. A couple of examples: "Le salaire de la peur" (1953) - it even has two Italian titles "Vite vendute" and "Il salario della paura", but none of them is retrieved; "Das weiße Band" (2009), whose Italian title is "Il nastro bianco", but the scraper keeps downloading the German one.

If I set, for instance, Brazil as country preferences, the scraper correctly retrieves Portuguese titles.
find quote
olympia Offline
Team-XBMC Member
Posts: 2,381
Joined: May 2008
Reputation: 30
Post: #2
Delete line 192-197 from imdb.xml
find quote
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #3
Which version? In my version lines 192-197 are across two subsections and if I delete them nothing works
find quote
olympia Offline
Team-XBMC Member
Posts: 2,381
Joined: May 2008
Reputation: 30
Post: #4
228-233 in the latest imdb scraper.
find quote
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #5
I have version 2.2.2 but it only has 205 lines...

EDIT: I was looking in the wrong directory. Thanks indeed, this works!
(This post was last modified: 2011-06-22 22:20 by gspinoza.)
find quote
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #6
It worked but some Italian movies are then retrieved with their international title: for instance "La prima cosa bella" is retrieved as "The first beautiful thing"... ?!?
find quote
olympia Offline
Team-XBMC Member
Posts: 2,381
Joined: May 2008
Reputation: 30
Post: #7
Well, that's because there is no Italian title listed on imdb...

The lines you deleted were forcing the imdb display title if the movie was filmed in Italy.
find quote
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #8
I think I'll revert to the official scraper. Now every Italian movie is retrieved with its international title...

What I do not understand, about the official scraper, is that some foreign movies are correctly retrieved by their Italian names, and some others with a title in another language, even if both have, on IMDB, the Italian alias...
find quote
olympia Offline
Team-XBMC Member
Posts: 2,381
Joined: May 2008
Reputation: 30
Post: #9
How about reading my explanation above?

Edit:
Try to replace line 229 in the official scraper with this one and check if it brings better results

Code:
<expression>&gt;&lt;a href=&quot;/country/[^&gt;]+&gt;($INFO[akatitles]&lt;/a&gt;&lt;/div&gt;)</expression>
(This post was last modified: 2011-06-24 09:34 by olympia.)
find quote
gspinoza Offline
Junior Member
Posts: 12
Joined: May 2011
Reputation: 0
Post: #10
olympia Wrote:How about reading my explanation above?
I read it but it does not explain. Some examples:
- French movie "Le salaire de la peur". Film has an Italian alias "Vite vendute". It was not filmed in Italy. Retrieved as "Le salaire de la peur".
- American movie "The ten commandments" (1923). Film has an Italian alias "I dieci comandamenti". It was not filmed in Italy. Retrieved as "I dieci comandamenti".

So it does not explain why, starting from the same initial conditions, results are different.

Quote:Edit:
Try to replace line 229 in the official scraper with this one and check if it brings better results

Code:
<expression>&gt;&lt;a href=&quot;/country/[^&gt;]+&gt;($INFO[akatitles]&lt;/a&gt;&lt;/div&gt;)</expression>

I'll give it a try and let you know.
find quote
Post Reply