(mouton @ april 04 2005,16:57 Wrote:i work in xml. data-scrapping html file is very ugly and sad...
as i already said in another thread, a way i would be willing to work with would be if someone is willing to host data-scrapping php scripts that would be called by every user of xbmc, and those scripts would do the data-scrapping of the html pages, and produce xml data.
then, when the html pages changes, only the php scripts would have to be changed to produce the same xml.
that would work fine as long as websites don't ban the data scrapper's ip... though i'd guess that would happen often.
maybe we could create a 'pool' of hosts for the data scrapping scripts, and a central gateway could randomize access or something...
it might be a good idea to contact those websites and ask them if they would be willing to allow us to use their data in exchange for their logo on the script or something. but i don't see yahoo accepting such a proposal...
although html scraping is not really a preferred method, a different thinking pattern though: provide the scraping patterns or xmldom processing via one or 2 remote hosts (database filled with patterns and other data that is needed), there are several advantages: everyone can contribute, scraping patterns are centralized and no host will be banned
anyway it is time that streaming feeds become available in huge xml lists :o