Filmweb scraper

  Thread Rating:
  • 2 Votes - 3 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
zxcvbn1971 Offline
Junior Member
Posts: 4
Joined: Mar 2009
Reputation: 0
Post: #51
I'm afraid this scraper does not work anymore. Only 30% of movies are correctly added to XBMC database. Remaining movies are added with errors (no title or description - famous doNotEscapeHTML) or not added at all....
Sad
(This post was last modified: 2009-05-15 10:19 by zxcvbn1971.)
find quote
wojak Offline
Junior Member
Posts: 10
Joined: Jul 2009
Reputation: 0
Post: #52
hi,
something new about this scraper? It works for you or not?
find quote
plebann Offline
Junior Member
Posts: 2
Joined: Mar 2009
Reputation: 0
Post: #53
Hi.
I have a question:
what about Filmweb scraper in new version? Will be available?
(This post was last modified: 2010-04-13 16:24 by plebann.)
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #54
for now i have no time - maybe i try in summer

from today we have new filmweb portal - my old filmweb scraper is not working anymore

smuto

[Image: 1.png]
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #55
just start working on new filmweb portal - basic scraper is ready for tests

metadata.filmweb.pl.zip

but it seems that "oacute problem" is back, accented characters are show as a entities

smuto

[Image: 1.png]
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,172
Joined: Nov 2003
Reputation: 81
Post: #56
you need to specify the fixchars attribute, see http://trac.xbmc.org/changeset/31124

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #57
filmweb is in utf - so i flag the outputted xml as utf-8

after fixchars="1" there is no "oacute problem", but i have encoding problem - looks like i'm getting utf-8 encoded results which is passed through a utf-8 converter yet again (which doesnt play nice)

smuto

[Image: 1.png]
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,172
Joined: Nov 2003
Reputation: 81
Post: #58
oh bugger. yeah, fixchars only works on ascii shit. give me some time to think of a solution, i'll get it in.

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
mako777 Offline
Junior Member
Posts: 5
Joined: Jul 2010
Reputation: 0
Post: #59
Hi,
The new scraper is working pretty well except of two problems/bugs that I have.
First is that in movies list, some movies instead of only title, eg. "Hitman" has something like this "Hitman (2007) - Filmweb".
And Second is that if scraper doesn't find the movie, it's not added to the library and then it's hard to find which movie is not added (I have to search for it manually), this could be easier if such movie will be added to the library with for eg. unknown title
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #60
i just made same fix for weekend tests - please try it

Quote:And Second is that if scraper doesn't find the movie

can u be more accurate? title of the movie or file name?

or just give me an example

smuto

[Image: 1.png]
find quote
Post Reply