![[Image: Image1.jpg]](http://i694.photobucket.com/albums/vv306/Nicezia/Image1.jpg)
As you can see i want the programs to have to do very little in the way of actually processing any scraper information
Nicezia
Fan Joined: Nov 2006 Reputation: 0 Location: Montgomery, Alabama |
2009-05-11 05:49
Post: #31
Here's a idea of how i want the library to interface with programs
![]() As you can see i want the programs to have to do very little in the way of actually processing any scraper information
(This post was last modified: 2009-05-11 06:02 by Nicezia.)
|
| find quote |
Nicezia
Fan Joined: Nov 2006 Reputation: 0 Location: Montgomery, Alabama |
2009-05-11 06:51
Post: #32
Because i want to account for all the behaviours of the XBMC parser...
There's something i've run across with the TVdb scraper which is a RegExp without an input buffer reference, i'm assuming if there is no input reference it automatically assumes buffer $$1? |
| find quote |
nul7
Posting Freak Joined: Oct 2008 Reputation: 14 |
2009-05-11 08:45
Post: #33
there is also an issue that I didn't know how to handle with the allocine scraper:
Code: <RegExp input="$$1" output="<setting label="Activer les Vignettes d'acteurs" type="bool" id="actor" default="[b][color=Red]falsetrue[/color][/b]"></setting>" dest="5+">I just changed the actual XML file to read "true" so it could be parsed correctly. Is this intentional and needs to be coded around? If so, what exactly are we doing here? |
| find quote |
Nicezia
Fan Joined: Nov 2006 Reputation: 0 Location: Montgomery, Alabama |
2009-05-11 08:57
Post: #34
yeah also there's scrapers that use dest="5+" on the first RegExp to be executed, (and for some reason my recursive coding goes over the process twice (and i haven't been able to nail down why, although if you saw in the source there was a place that i had a suspicion was the cause) so with (TVdb scraper for example) it ends up twice as long as it should be.
|
| find quote |
Nicezia
Fan Joined: Nov 2006 Reputation: 0 Location: Montgomery, Alabama |
2009-05-12 13:28
Post: #35
Code: - <details>Okay I'm down to the final little bit of getting my code completely working (all the simpler ones work, its the IMDB one i'm working on now) and this is what i'm coming up with (before processing custom functions... so what exactley is <url cache> for? what does it do or tell XBMC to do?
(This post was last modified: 2009-05-12 13:33 by Nicezia.)
|
| find quote |
sbass
Junior Member Posts: 2 Joined: May 2009 Reputation: 0 |
2009-05-12 14:55
Post: #36
Nicezia,
I've been following the XMLParser work by Spiff followed by the library work by yourself. I'm very interested in seeing something like this implemented for MeediOS in order to avoid issues of scrapers breaking plugins, etc. My question for you is this (and know that I didn't look at the code yet): Is there a mechanism for the parser library to just fetch the top search result without returning a list of selections for the user to choose the best match? The reason I'm asking this is while I personally prefer user initiated matching, some people like to setup importers to auto-import and then manually correct within the UI if necessary. Finally, if we end up determining that some particular fields of interest are missing in an XMLScraper file how are you guys managing modifications or versioning to those scripts? Thanks. Shawn |
| find quote |
Nicezia
Fan Joined: Nov 2006 Reputation: 0 Location: Montgomery, Alabama |
2009-05-12 15:11
Post: #37
Yes I'm working on that as an alternative
I've run into situations where there is an EXACT match only, so i'm trying to code in the ability to return only the First match that is returned I think however that will be set up as a "Return Only Exact Match" setting, naturaully that would b dependant on supplying the right information on a search. The imdb especially is bad about not getting the exact match as the first, so i'm working on a routine that will sort the matches (don't know if they have that in the XBMC scraper system, but i plan to implement it in mine. |
| find quote |
spiff
Grumpy Bastard Developer Joined: Nov 2003 Reputation: 82 |
2009-05-12 15:22
Post: #38
cache is a local filename. we cache the url to that file. it is used for speeding up running several functions on the same page
Always read the XBMC online-manual, FAQ and search the forum before posting. Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules. For troubleshooting and bug reporting please make sure you read this first. |
| find quote |
spiff
Grumpy Bastard Developer Joined: Nov 2003 Reputation: 82 |
2009-05-12 15:41
Post: #39
sbass, that's the beauty of xml. if a field is available, you parse it. if not, you do not. the scrapers have no clue what they are returning as such, to them all is text.
if these guys gets adopted more widely, we should definitely add a version field to them. since it's all xml, again no problem
Always read the XBMC online-manual, FAQ and search the forum before posting. Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules. For troubleshooting and bug reporting please make sure you read this first. |
| find quote |
spiff
Grumpy Bastard Developer Joined: Nov 2003 Reputation: 82 |
2009-05-12 15:46
Post: #40
and yes, we have sorting etc in our system. but that is all external to the scraper stuff
Always read the XBMC online-manual, FAQ and search the forum before posting. Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules. For troubleshooting and bug reporting please make sure you read this first. |
| find quote |