[RELEASE] FilmAffinity (Spanish) scraper

  Thread Rating:
  • 2 Votes - 5 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
serieofilo Offline
Member
Posts: 65
Joined: Sep 2011
Reputation: 0
Location: Barcelona, Spain
Question  Incorrect title when english title between () Post: #371
Hello,

I've been using FA scraper for sometime and I've found the following inconsistency when downloading information of movies with english/spanish names between () but only when the movie is not a video file in the disk but a pointer to a DVD disc.

The problem is that the title has extra blank spaces between the movie name and the english/spanish name between () but only if the movie is a .disc file (a pointer for an external DVD disc).

For example, this pointer to a DVD file is getting incorrect information, 3 spaces between Jersey and the (:

Una chica de Jersey (Jersey Girl) (2004).dvd.disc

Code:
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<movie>
    <title>Una chica de Jersey   (Jersey Girl)</title>
    <originaltitle>Jersey Girl</originaltitle>

This one is getting good information, only 1 space between the name and the (:

The Reader (El lector) (2008).avi

Code:
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<movie>
    <title>The Reader (El lector)</title>
    <originaltitle>The Reader</originaltitle>

Any idea about what's the problem?

Thank you.

Desktop & HTPC System:
Intel Core 2 Quad Q9450 @ 2.66GHz || 4 x 2GB DDR2-1066 Kingston @ 1066MHz
Sapphire HD 6450 1GB || Dell UltraSharp 2009W @ 1680x1050 (using DVI) || Philips LCD 42PFL8404H @ 1920x1080 (using HDMI through Yamaha SR-301)
Yamaha YHT-S401 (SR-301 Receiver + NS-BR301 Speaker)
Hauppauge WinTV-HVR-4000
find quote
pancheto Offline
Junior Member
Posts: 32
Joined: Nov 2011
Reputation: 0
Location: Santiago de Compostela
Post: #372
I've tried to replicate your issue, and although I've found some interesting things I haven't found any problem on the scrapper since the search results don't depend at all on the filename or its extension, but on the film entry's filmaffinity code itself.

the problem with Jersey Girl in particular, regardless its file and filename nature, is that its filmaffinity film entry has indeed 3 spaces on its title:
Code:
<title>Una chica de Jersey   (Jersey Girl) (2004) - FilmAffinity</title
the good news are that the scrapper works fine, and that you're getting the right information stored on XBMC's database, so I would consider this as a minor bug rather than an error. but if I understand properly you are suggesting to correct wrong filmaffinity data on the fly as being parsed, so since I don't know how to do so (looks really simple, although I'm still new at scraper coding) I'll report this issue on github in case any other coder may be able to include it in a future scraper version.
find quote
pancheto Offline
Junior Member
Posts: 32
Joined: Nov 2011
Reputation: 0
Location: Santiago de Compostela
Post: #373
I've just addressed this problem (the extra spaces coming from FilmAffinity) on github. I'm sure that it'll soon be commited on the master branch, and then commited on the official XBMC repository.
find quote
tonybeccar Offline
Member
Posts: 66
Joined: Nov 2010
Reputation: 0
Post: #374
Hello, I've been using the FA scraper since I have XBMC, and one feature that I now see available in the imdb scraper is the only thing that IMO this script is missing!! The imdb script has the option to scrape the movie title based on a predefined country. This is really useful for me and I assume for many others, because if a person doesn't live in Spain, some titles may become confusing for the user, resulting in renaming lots of movies by hand..

So, I'm asking, would it be possible to include this feature in the FA scraper? Maybe a copy paste of the IMDB scraper?

Thanks in advance!!
find quote
pancheto Offline
Junior Member
Posts: 32
Joined: Nov 2011
Reputation: 0
Location: Santiago de Compostela
Post: #375
the main idea behind the FA scraper is indeed to search in FA. although the scraper tries to enrich those search results with information from other sources (such as IMDB), the way to locate an entry on FA can only be done by its spanish title (logical, since this is a spanish community) or by its original title. I know agjacome is working on a way to leave the original title stored on XBMC db instead of the spanish one, but I don't think looking for other title language should be the aim for FA scraper.
find quote
itombs Offline
Senior Member
Posts: 142
Joined: Oct 2008
Reputation: 0
Post: #376
Hi, FA scraper doesnt work well since this mourning.
The problem seems to be with the IMDB and other resources like "Actors with rol (IMDB)" and "FanArt".
Its happens in both Dharma and Eden.
Someone could fix it?
Thanks a lot.
(This post was last modified: 2012-03-29 15:15 by itombs.)
find quote
serieofilo Offline
Member
Posts: 65
Joined: Sep 2011
Reputation: 0
Location: Barcelona, Spain
Post: #377
(2012-01-11 19:53)pancheto Wrote:  I've just addressed this problem (the extra spaces coming from FilmAffinity) on github. I'm sure that it'll soon be commited on the master branch, and then commited on the official XBMC repository.

Sorry, I've missed this post for a long time.

Thank you very much for your change. I'll try the latest FA scrapper with my movies.

Desktop & HTPC System:
Intel Core 2 Quad Q9450 @ 2.66GHz || 4 x 2GB DDR2-1066 Kingston @ 1066MHz
Sapphire HD 6450 1GB || Dell UltraSharp 2009W @ 1680x1050 (using DVI) || Philips LCD 42PFL8404H @ 1920x1080 (using HDMI through Yamaha SR-301)
Yamaha YHT-S401 (SR-301 Receiver + NS-BR301 Speaker)
Hauppauge WinTV-HVR-4000
find quote
tonybeccar Offline
Member
Posts: 66
Joined: Nov 2010
Reputation: 0
Post: #378
(2012-03-28 09:52)pancheto Wrote:  the main idea behind the FA scraper is indeed to search in FA. although the scraper tries to enrich those search results with information from other sources (such as IMDB), the way to locate an entry on FA can only be done by its spanish title (logical, since this is a spanish community) or by its original title. I know agjacome is working on a way to leave the original title stored on XBMC db instead of the spanish one, but I don't think looking for other title language should be the aim for FA scraper.

I understand your point of view, but the FA scraper serves to ALL the spanish speaking audience. And thus so, movie titles are diferent in many regions. What I'm asking, if it's not too much trouble is at least to have an option to choose the title.. besides the scraper already connects to imdb.. And I think it should be easier, because IMDB automatically shows the title of my country because it figures it out with my ip, so, when the scraper scrapes imdb, it will be looking at the title that is right for my country.. Maybe the option could be something like "Show movie title based on your region", and what it does is to scrape the imdb main title, as it automatically will appear translated, in a few words, fetch the imdb title.. I would like to see it implemented if it is not too much trouble..
find quote
Ovokx Offline
Senior Member
Posts: 135
Joined: Feb 2012
Reputation: 2
Location: Spain
Post: #379
Is it working fine this script for you guys? Since Eden release always shows me a message than can't connect in order to scrap the new movies found Sad

[Image: all-thin-banner.jpg]

XBMC 12 + Aeon Nox 4.1.7
Mac Mini 2009 + OS X 10.7.4 Lion + iOmega MiniMax Desktop + NewerTech miniStack
Sony Bravia 42" + Sony DAV-F200
(This post was last modified: 2012-04-22 12:54 by Ovokx.)
find quote
pancheto Offline
Junior Member
Posts: 32
Joined: Nov 2011
Reputation: 0
Location: Santiago de Compostela
Post: #380
(2012-04-01 07:50)tonybeccar Wrote:  I understand your point of view, but the FA scraper serves to ALL the spanish speaking audience. And thus so, movie titles are diferent in many regions. What I'm asking, if it's not too much trouble is at least to have an option to choose the title.. besides the scraper already connects to imdb.. And I think it should be easier, because IMDB automatically shows the title of my country because it figures it out with my ip, so, when the scraper scrapes imdb, it will be looking at the title that is right for my country.. Maybe the option could be something like "Show movie title based on your region", and what it does is to scrape the imdb main title, as it automatically will appear translated, in a few words, fetch the imdb title.. I would like to see it implemented if it is not too much trouble..
what you are asking for is for a combination of scrapers, which in essence is a good idea, although not very easy to implement: you are suggesting to do a first search on a larger database, and then go with the result (assuming unique, which is not always the case) to FA and perform the FA query. any movie scraper works almost in the same way: it first checks for the main entry in the main database (usually by title only), and then IF IT'S FOUND it queries other databases for data completion/enrichment. since this is the FA scraper, the main database it uses is FA, and the titles stored in it are only the Spanish and the original version, not any others. so if you want to use the FA scraper you will have to stick to FA's restrictions.
I understand your concern, since there are a few things I have locally which aren't found on FA although IMDB has them (I truly can live with these ~5% of my local items). unfortunately there's no way on XBMC to search 2 different databases and filling your local database with one information or the other by prioritizing them or depending on whether you find information in one or on the other database. if what you look for is always found on IMDB, then you should use IMDB as your main database, hence the IMDB scraper should be the one you should use, and if you want to add FA information then you'll have to ask the IMDB programmers (or program it yourself) to include a FA enrichment function.

(2012-04-22 12:53)Ovokx Wrote:  Is it working fine this script for you guys? Since Eden release always shows me a message than can't connect in order to scrap the new movies found Sad
I have seen a couple of "network error" messages lately, but the scrapper does work fine most of the times, so I'm pretty sure it's a matter of a particular query that is not being able to perform punctually. if the problem persists you will have to debug it, and seeing that debug log we would ultimately be able to help you solve it.
find quote
Post Reply