Posts: 498
Joined: Jan 2009
Reputation:
2
taxigps
Team-XBMC Python Developer
Posts: 498
I have written two scraper for chinese movie site. Mtime.com use utf-8, so I can matching any word from the web page. But imdb.cn use gb2312, only english character could be matching, some key word in chinese can't be matching. How to resolve this?
Posts: 12,706
Joined: Nov 2003
Reputation:
129
spiff
Team-Kodi Member
Posts: 12,706
set the correct encoding on your scraper
Posts: 498
Joined: Jan 2009
Reputation:
2
taxigps
Team-XBMC Python Developer
Posts: 498
I'll modify imdb.cn scraper to use chinese key word to gether more information. Thanks!
Posts: 12,706
Joined: Nov 2003
Reputation:
129
spiff
Team-Kodi Member
Posts: 12,706
to make sure you got my point;
the scraper itself is an xml file. set its encoding using
?xml version="1.0" encoding="gb2312"?>
Posts: 498
Joined: Jan 2009
Reputation:
2
taxigps
Team-XBMC Python Developer
Posts: 498
Thanks for your help. I can use chinese key word in scraper now. It's works well.