(2013-01-26 17:21)Zazza Wrote:(2013-01-24 13:50)gborri Wrote: thanks Zazza your suggestione have resolved my problem.
I have a question for WG++Maker.
I would like to introduce, for the ini file gudetv.sky.it, the espisode info: my problem is that i would like to introduce either xmltv_ns or onscreen.
how can I make it happens?
thanks
Giovanni
To get episode/season numbers use this guidatv.sky.it.ini file I've modified.
Code:
**------------------------------------------------------------------------------------------------
* @header_start
* WebGrab+Plus ini for grabbing EPG data from TvGuide websites
* @Site: guidatv.sky.it
* @MinSWversion: V0
* none
* @Revision 2 - [30/08/2011] Willy De Wilde/Jan van Straaten
* added credits/category and production date
* @TESTING - [26/01/2013] Zazza
* added episode-num in xmltv_ns format
* @Remarks:
* none
* @header_end
**------------------------------------------------------------------------------------------------
site {url=guidatv.sky.it|timezone=UTC+01:00|maxdays=7.1|cultureinfo=it-IT|charset=UTF-8|titlematchfactor=90|episodesystem=xmltv_ns}
url_index{url|http://guidatv.sky.it/guidatv/canale/|channel|.shtml}
urldate.format {daycounter|0}
*
index_urlshow {url ()||<a href="||">|</li>}
*
index_showsplit.scrub {multi ()|<p class="ora">|||<li class="dispari">}
index_date.scrub {single(force)|<p class="giorno">||h.|<p class="tools">}
index_start.scrub {single|||</p>}
index_title.scrub {single(separator=" - " include=first)|<strong>||</strong>|</li>}
*
* enable the next two lines to create a channel list
*index_site_channel.scrub {multi|<ul id="clup-menu-bar"|class="">|</a>|</ul><!-- end clup-menu-bar -->}
*index_site_id.scrub {multi|<ul id="clup-menu-bar"|weekChannel=|" class=|"</ul><!-- end clup-menu-bar -->}
*
title.scrub {single(separator=" - " include=first)|<title>|||</title>}
subtitle.scrub {multi(separator=" - " exclude=first)|<title>||</title>}
temp_1.scrub {single(exclude="protagonista")|<div class="__description">||</div>|</div>}
temp_2.scrub {multi|<div class="__description">||</div>|</div>}
temp_3.scrub {multi|<div class="testo">||</div>|</div>}
* find season
temp_4.scrub {multi(separator=" " include=first)|<div class="__pilat">||</div>|</div>}
temp_4.modify {remove('temp_4' not "")|\'}
temp_4.modify {calculate(not "" format=F0)|'temp_4' 1 -}
* find episode
temp_5.scrub {single|<div class="__pilat">| Stagione Ep.| -}
temp_5.modify {calculate(format=F0)|'temp_5' 1 -}
episode.modify {addend('temp_4' = "")|...}
episode.modify {addend('temp_4' not "")|'temp_4'}
episode.modify {addend('temp_4' not "")|.'temp_5'.}
description.scrub {single|__pilat">||</div>|</div>}
category.scrub {single|<h5>Informazioni</h5>|<strong>Genere</strong>: |<br />|<br />}
rating.scrub {single|<h5>Informazioni</h5>|<img style="display:inline"|</p>|</p>}
director.scrub {single(separator=", con " include=first)|<div class="testo">|Regia di |; |</div>}
actor.scrub {single(separator=", con " exclude=first)|<div class="testo">|Regia di |; |</div>}
*
*
* operations:
subtitle.modify {remove|Sky.it}
description.modify {addend(null)|'temp_1'}
description.modify {addend(null)|'temp_2'}
description.modify {addend(null)|'temp_3'}
description.modify {remove|<span style="font-weight: bold;">}
description.modify {remove|<span style="font-style: italic;">}
description.modify {remove|<font face="Arial">}
description.modify {remove|<span style="FONT-WEIGHT: bold; FONT-STYLE: italic">
description.modify {remove|<span style="FONT-STYLE: italic">
description.modify {remove|<span style="FONT-WEIGHT: bold">
description.modify {cleanup}
rating.modify {replace(~ "per tutti")|'rating'|per tutti}
rating.modify {replace(~ "bambini accompagnati")|'rating'|bambini accompagnati}
rating.modify {replace(~ "V.M. 12")|'rating'|12+}
rating.modify {replace(~ "V.M. 14")|'rating'|14+}
productiondate.modify {calculate(format=F0)|'description' 1 *}
productiondate.modify {remove(0)|'productiondate'}
description.modify {remove|Regia di 'director', }
description.modify {remove|con 'actor'; }
actor.modify {replace|,|\|}
Please test it and let me know if it's working inside a frontend. Consider that the HTML source page not always shows season and episode numbers. In that case the episode-num element is filled with the string "..." .
Thank you.
thanks zazza,
in the meanwhile i have tried a solution and seems to work. I'll look to your code and let you if i have make something different.
I know that not always there are episode/season info, but ig webgrab+plus is fast enough to grab in incremental i hope to have more chance to get it.
i have a problem (i think you too): the italian letter à ò ì etc are not recognized correctly, have you found a solution?
thanks
Giovanni


Search
Help