I'm good at regular expressions due to my job, so I'm considering going into this scrapers world to understand how variables are really working, which is what is stopping me from modifying this one.
the year option was the first thing I thought when I came to open the scraper code, but since I thought that \1 contained all the file name and that the year could not be splitted from it I stopped from continuing. if you give me some hint how to split year from name I would try learning scraper coding and maybe try contributing to this great scraper.
[RELEASE] FilmAffinity (Spanish) scraper
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-11-19 11:08
Post: #361
|
| find quote |
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-11-21 02:22
Post: #362
after revising the code, I've came with a few updates that improve the scraper's work. since it's still in alpha version (as I'm not yet into the proper scraper coding yet), I will only post here the line changes for anyone to test them. the first one is the main change, which works fantastically (is able get all filmaffinity's info from ~700 file names without any error), and the other one is just a suggestion to bypass google search.
line 11:Code: <RegExp input="$$1" output="<url>http://www.filmaffinity.com/es/advsearch.php?stype[]=title&fromyear=$$2&toyear=$$2&stext=\1</url>" dest="3">Code: <RegExp input="$$1" output="<url>http://www.filmaffinity.com/es/search.php?stext=\1&amp;stype=none</url>" dest="3"> lines 124 to 138:Code: <RegExp input="$$9" output="<url function="GoogleToIMDB">http://www.imdb.com/search/title?year=$$6&title=$$9</url>" dest="5+">Code: <RegExp input="$$9" output="<url function="GoogleToIMDB">http://www.google.com/search?q=site:imdb.com\1</url>" dest="5+">![]() note:the search suggestion is only trying to bypass google search not because it isn't performing well (on the contrary, it does perform much better than my suggestion), but because when google detects a few hundreds of searches with the same structure and coming from the same IP it blocks the results page (after ~200 automatic searches it requests a captcha solving) and for that reason updating an entire library of hundreds of files is very complicated. I have tried to bypass http://www.google.com/ queries using http://www.google.es/ or even https://www.google.com/, but without luck.
(This post was last modified: 2011-11-21 03:10 by pancheto.)
|
| find quote |
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-11-21 17:19
Post: #363
forget about my previous post, as I finally worked out how to bypass google's search limitations, which were stopping me from batch updating my entire library. in summary, the modifications I suggest to current filmaffinity's v1.4.1 scraper are only 2, very simple yet very useful ones:
I tested this modification with a ~700 file names library and it worked like a charm, pointing always to the perfect filmaffinity movie and getting >90% of the movies' fanarts. ![]() some future improvement? sure there is plenty, but what it came to me as obvious was the fact that some miniatures were not being downloaded appropriately from filmaffinity. I can download them manually through xbmc as it will suggest imdb's ones, but I was wondering why movies like Nixon (http://www.filmaffinity.com/es/film737736.html) don't get such miniature. in fact there's no lightbox overthem so it looks like the img code will surely look different. I guess I'll leave this for the proper scraper developers, in order to debug it and release a new scraper version including my 2 previous suggestions.
|
| find quote |
MaDDoGo
Senior Member Posts: 242 Joined: Sep 2009 Reputation: 1 Location: Sabadell (Barcelona) |
2011-11-22 22:58
Post: #364
Hi,
I looked at your modifications and (after adjacome tests) I merged it into the repo so the scraper is enhanced with your modifications. Thanks for your time and the modifications. |
| find quote |
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-11-24 02:17
Post: #365
I have implemented a few improvements on filmaffinity poster searching at github, hoping that you'll find them useful.
(This post was last modified: 2011-11-24 02:34 by pancheto.)
|
| find quote |
itombs
Senior Member Posts: 142 Joined: Oct 2008 Reputation: 0 |
2011-11-25 23:23
Post: #366
Hi, the number of votes is not working since a few days.
I updated to the MaDDoGo github version but not work with number of votes. Please, fix the number of votes. Thanks a lot. |
| find quote |
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-11-26 01:32
Post: #367
the scraper code looks for the number of votes between brackets, although now (filmaffinity is currently working on its look) it appears without them. I'll report this to MaDDoGo hoping to have it solved in the next version.
|
| find quote |
itombs
Senior Member Posts: 142 Joined: Oct 2008 Reputation: 0 |
2011-12-01 19:02
Post: #368
When could be fixed the problem with number of votes?
There is news about this? |
| find quote |
pancheto
Junior Member Posts: 32 Joined: Nov 2011 Reputation: 0 Location: Santiago de Compostela |
2011-12-01 21:40
Post: #369
the fix has been already submitted to XBMC's main repository, and you should see it updated on your system as version 1.4.3. if the addon doesn't get updated automatically try doing it manually, or downloading the code from MaDDoGo's github repository.
|
| find quote |
itombs
Senior Member Posts: 142 Joined: Oct 2008 Reputation: 0 |
2011-12-02 01:43
Post: #370
pancheto Wrote:the fix has been already submitted to XBMC's main repository, and you should see it updated on your system as version 1.4.3. if the addon doesn't get updated automatically try doing it manually, or downloading the code from MaDDoGo's github repository. Thanks a lot. Works well. See you. |
| find quote |



Search
Help