XBMC Community Forum
ScraperXML (Open Source XML Web Scraper C# Library) please help verify my work... - Printable Version

+- XBMC Community Forum (http://forum.xbmc.org)
+-- Forum: Development (/forumdisplay.php?fid=32)
+--- Forum: Scraper Development (/forumdisplay.php?fid=60)
+--- Thread: ScraperXML (Open Source XML Web Scraper C# Library) please help verify my work... (/showthread.php?tid=50055)

Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33


ScraperXML (Open Source XML Web Scraper C# Library) please help verify my work... - Nicezia - 2009-04-30 20:44

Just to make sure i'm getting this right(don't blast me i am just trying to verify my work)

Code:
scraper $$(20) //array of 20 strings
|
|                        
Function //9 string fields(the info is compiled to xml format and sent as a
     |                                                  single string back to one of the 20
     |                                                    buffers)
     |
     |
     Regular Expression (sends the info back to the function arrays)
         |
         |
         Expression (makes matches for each field and sends back to the
                                   RegExp each field 1-9 as an array)

Am i understanding right? that from expression we have a a results-determined amount of or 9 string arays (var[?][8]) which is compressed into a single string by the RegExp (var[8]), which is sent to the functions in one of 9 possible string variables (var = single string) collected from each regexp then the function compresses these 9 fields into a single string which is sent to one of twenty of the scraper buffers, and At the end of a function the clearbuffers(if set) clears the 9 function fields?


- Nicezia - 2009-05-01 05:27

Nevermind i just figured the whole thing out, seems i was thinking about it in the wrong way, i have it figured out now...

however, is that the option override the regexp ignore culture-specifics?


- spiff - 2009-05-02 01:37

i do not understand what you mean.


- Nicezia - 2009-05-02 22:16

i suppose it would be easier just to ask what the override option does, because i haven't got a clue, i'm guessing it has something to do with the regular expression engine. but i'm not quite clear on what it sets the reg expression engine to do.


- spiff - 2009-05-03 00:15

there is no override option?


- Nicezia - 2009-05-03 13:36

From The Scaper.Xml Wiki:

Quote:conditonal="<condition>": A condition that must resolve to TRUE for the particular RegExp to be run. Currently the only available condition is "override", which is set based on the Language Override setting in the scraper.

Can you point me to the code that handles this function in XBMC?


- spiff - 2009-05-03 13:54

aha.

those are scraper settings. they are the stuff returned from the <GetSettings> scraper function.

see PluginSettings.cpp (CBasicSettings) and ScraperSettings.cpp.

they are also used with the $INFO[settingname] construct


- Nicezia - 2009-05-03 14:21

oh ok, perhaps that's why it didn't make sense to me, i haven't tackled the whole custom function thing yet.


What's the protocol on get settings - Nicezia - 2009-05-05 00:58

btw, i have it running most scrapers, ported it to monodevelop and compiled it and it works the same on both mono and .NET. Currently its console only (I seperated it from the ui, because the UI actually was kinda distracting me from coding the damn thing.(I'd code a bit, and then slip over to the ui to consider how to integrate that code into the UI.)

I had asked a question, but once i looked into plugin settings that was answered for me Smile


- spiff - 2009-05-05 13:22

great! Smile

i would put alot of effort into having your parser work as a library if i were you. it will make it alot more useful, in particular i sincerly hope that stuff like MIP will pick it up.