[RELEASE] KinoPoisk2 (Russian Movies) Scraper

  Thread Rating:
  • 0 Votes - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Thread Closed
ababak Offline
Junior Member
Posts: 8
Joined: Apr 2009
Reputation: 0
Location: Kiev, Ukraine
Thumbs Up  [RELEASE] KinoPoisk2 (Russian Movies) Scraper
Post: #1
Hi,

Let me present another KinoPoisk.ru scraper. It's a completely re-worked scraper Kinopoisk.ru with following features:
  • Optimized regexps
  • Low-res cover if no poster present (really helpful on some old movies)
  • Artists' roles
  • Can fetch movie stills fanart, wallpapers fanart, or both
  • Fixed incorrect parsing of outline/plot


Download version 1.0 of KinoPoisk2 from here:
http://files.me.com/andrey_babak/gtxbcl

P.S. I'd like to thank spiff for his help!
(This post was last modified: 2009-04-14 00:57 by ababak.)
find
spiff Offline
Retired Developer
Posts: 12,386
Joined: Nov 2003
Post: #2
awesome for you russians Smile

one question though; what does that ServerEncoding tag do?
find
diemos Offline
Senior Member
Posts: 178
Joined: Feb 2009
Reputation: 0
Post: #3
you are the man! spasibo balshoye. I was waiting for this.

Zemlyak, ya tozhe s Kieva teper v NY.

The Transforminators HD Movie Trailer
- from the creators of Terminator and Transformers -
find
ababak Offline
Junior Member
Posts: 8
Joined: Apr 2009
Reputation: 0
Location: Kiev, Ukraine
Post: #4
spiff Wrote:awesome for you russians Smile

one question though; what does that ServerEncoding tag do?

I didn't check the source of the parser but as far as I can tell looking at the original scraper, it defines how the external URLs are parsed. Maybe it just does nothing though ;-) (or works in Plex only)
find
spiff Offline
Retired Developer
Posts: 12,386
Joined: Nov 2003
Post: #5
i know that it must be a plex thing as i wrote the scraper parser and most of the surrounding code Smile
find
ababak Offline
Junior Member
Posts: 8
Joined: Apr 2009
Reputation: 0
Location: Kiev, Ukraine
Post: #6
By the way, does the parser handle server encoding returned in headers? It would be great to make scraper completely UTF-8
find
spiff Offline
Retired Developer
Posts: 12,386
Joined: Nov 2003
Post: #7
scraper code does honor the encoding you set on the returned xml.

i guess the ServerContentEncoding is used to convert the html pages to utf-8 prior to passing them to the scrapers. i will dig in the plex git

edit: dug a bit. it's nonsense from the plex devs. the servercontentencoding is just a dupe of the encoding set on the returned xml
(This post was last modified: 2009-04-14 00:49 by spiff.)
find
hamp Offline
Member
Posts: 58
Joined: Jul 2008
Reputation: 0
Post: #8
When xbmc load info from the site, Kinopoisk.ru ban me about 30 minutes. Because of what? At the Plex this does not happen.
find
TigerHeart Offline
Junior Member
Posts: 11
Joined: Feb 2009
Reputation: 0
Sad  Scraper doesn't work
Post: #9
I try to get info about the movie Butterfly effect (I type movie name in russian - "Эффект бабочки"). But the scraper returns me next list of movies:
==============
Интервью с вампиром
Сделка с дьяволом
Мадагаскар 2
Ирония судьбы. Продолжение.
Загадочная история Бенджамина Баттона
Суини Тодд,демон+парикмахер с Флит-стрит
==============
And I see the same list every time when I try to get info about any movie. Whats wrong?
Thanks.
PS. I made the screenshots, but I can't understand how to attach them here. But I can send them to anyone by e-mail.
find
GooglieS Offline
Junior Member
Posts: 8
Joined: Oct 2008
Reputation: 0
Post: #10
This script does not load any information/art from kinopoisk! Something is broken?
//не работает! фильм из списка находит, но никакую инфу с кинопоиска не подгружает Sad Что делать?
find
hamp Offline
Member
Posts: 58
Joined: Jul 2008
Reputation: 0
Post: #11
Попытки исправить пока, что нулевые. Вот ждем гуру создателей хбмс. Исправлено только для Plex - ссылка на форум. И очень интересная заметка - бан на самом кинопоиске по ип. И точно так же, как и у TigerHeart.
(This post was last modified: 2009-05-14 20:53 by hamp.)
find
GooglieS Offline
Junior Member
Posts: 8
Joined: Oct 2008
Reputation: 0
Post: #12
Как банит? Меня хттп не банит!
find
TigerHeart Offline
Junior Member
Posts: 11
Joined: Feb 2009
Reputation: 0
Post: #13
Please, return the old version!!! We don't need your version 2!!! Nobody need it. It doesn't work at all!!! Version 1 is the best!!!
find
TigerHeart Offline
Junior Member
Posts: 11
Joined: Feb 2009
Reputation: 0
Post: #14
Eng: Does anybody know where I can download the old wersion of kinopoisk.xml?

Rus: Кто-нибудь знает откуда можно скачать старую версию файла kinopoisk.xml?
find
hamp Offline
Member
Posts: 58
Joined: Jul 2008
Reputation: 0
Post: #15
TigerHeart Wrote:Eng: Does anybody know where I can download the old wersion of kinopoisk.xml?

Rus: Кто-нибудь знает откуда можно скачать старую версию файла kinopoisk.xml?

kinopoisk.xm work fine. But ScraperParser.cpp not work.
Дело не в кинопоиске, а в скрипте, обрабатывающего этот скрапер. Именно в ScraperParser.cpp


Вот его история - http://trac.xbmc.org/log/branches/linuxp...?rev=10815

Вот попробуйте этот - Если работает, то пишите сюда. [ATTACH]86[/ATTACH]
Attached File(s)
.zip  KinoPoiskmakameufrendly.zip (Size: 6.48 KB / Downloads: 71)
(This post was last modified: 2009-05-15 13:15 by hamp.)
find
Thread Closed