Filmweb scraper

  Thread Rating:
  • 2 Votes - 3 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
spiff Offline
Grumpy Bastard Developer
Posts: 12,180
Joined: Nov 2003
Reputation: 82
Post: #11
please update the scraper to reflect the <nfourl> changes

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #12
i made update, but no time for tests
filmweb.xml
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #13
scraper in this link filmweb.xml
is only for tests to forum users (i need more testers)

last good working one i will always upload to SVN as a patche

smuto
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,180
Joined: Nov 2003
Reputation: 82
Post: #14
oh rite i did commit the last one - will stick to the sf ones from now on

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
dowiew Offline
Junior Member
Posts: 12
Joined: Apr 2007
Reputation: 0
Post: #15
Hi! (siema)
I have many movies and many nfo files associated with them with urls to imdb and filmweb entries inside. Filmweb scraper works well olny for urls starting with http://www.filmweb.pl/Film,id...
Doesn't work for urls like http://frantic.filmweb.pl/ and I have them plenty...
Please update the regexp, probably the matching one in NfoUrl so that filmweb url would be correctly recognized in the nfo...
Regards,
dowiew
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #16
wiesz co nie wiem dlaczego ale nigdy nie zadziałał mi regexp na link tekstowy, a nie mam czasu na testy

filmweb tylko maskuje numer id
twój link
http://frantic.filmweb.pl/

to ten sam co ten z id
http://www.filmweb.pl/Film?id=1107

lub ten
http://www.filmweb.pl/Film,id=1107

sorki ale nie planuję zaktualizować scrapera, mam nadzieję że pliki nfo zaczną zawierać id

smuto

[Image: 1.png]
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #17
i hope i can use my native language in this topic

[Image: 1.png]
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #18
help!!

quite simple. output xml in the format

<actor>
<thumb>...</thumb>
<name>something</name>
<role>somethingelse</role>
</actor>

but, i don't have thumb url in cast. So i try with "url function"

first without luck, but i like this idea (mayby this should work in libary by "Set Actor Thumb"

<actor>
<thumb><url function="ActorLink">...</url></thumb>
<name>something</name>
<role>somethingelse</role>
</actor>

second also without luck
<actor>
<name>something</name>
<role>somethingelse</role>
</actor>
<url function="ActorLink">somethinglink</url>

function="ActorLink"
<actor>
<name>something</name>
<thumb>...</thumb>
</actor>

don't know by mayby i need same numerator

actor$1 -> function="ActorLink$1"
actor$2 -> function="ActorLink$2"

my WIP
filmweb.xml

smuto

[Image: 1.png]
find quote
spiff Offline
Grumpy Bastard Developer
Posts: 12,180
Joined: Nov 2003
Reputation: 82
Post: #19
dont add the actors at that point.

1) make sure all function you call dont clear buffers
2) make sure not to destroy the buffer which holds the id when it enters getdetails (# of htmls +1)
4) grab the url and chain once per actor
5) use the id to grab the role from the filmography list.

that should do, no?

Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
find quote
smuto Offline
Senior Member
Posts: 240
Joined: Sep 2004
Reputation: 2
Post: #20
i made a lite ver.of scraper for tests
filmweb_only_actor_test.xml

from scrap.exe for "Goodbye Bafana"

details.xml

ActorLink.xml

why in ActorLink.xml i have only one (last) entry , scrap visit all url's from details

[Image: 1.png]
find quote
Post Reply