Yet another scraping issue..
#1
Scraping movie information doesn't work, and I can not find out why..

Debug log is here:
http://pastebin.com/nhacjmTb

I suppose this message is bad:
XFILE::CFileCurl::CReadState::FillBuffer: curl failed with code 22
(Look near the end of the log file)

My internet connection is fine, and the moviedb website is running...

Any help to a new XBMC user would be appreciated Big Grin Thanks!
Reply
#2
I guess I did not search the forum thoroughly enough. I found this topic:
http://forum.xbmc.org/showthread.php?tid=113687

I will try to uninstall the YouTube add-on when I get home and see what happens...
Reply
#3
That does look odd. You can tell what curl is trying to open by looking at the log. In this case the key line is:

Code:
DEBUG: FileCurl::Open(0882EE00) http://api.themoviedb.org/2.1/Movie.search/nl/xml/57983e31fb435df4df77afb854740ea9/avpr%2d%20aliens%20vs%20predator%20%2d%20requiem+2007

You can check the request by running curl manually; you'll need to download curl from (http://curl.haxx.se/). Open a command prompt and type:

Code:
curl "http://api.themoviedb.org/2.1/Movie.search/nl/xml/57983e31fb435df4df77afb854740ea9/avpr%2d%20aliens%20vs%20predator%20%2d%20requiem+2007"

where I got the argument for curl from the debug log (remember the quotes round the URL). Press return and this will send the request. On my PC I get:

Code:
<?xml version="1.0" encoding="UTF-8"?>
<OpenSearchDescription xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/">
  <opensearch:Query searchTerms="avpr- aliens vs predator - requiem 2007"/>
  <opensearch:totalResults>0</opensearch:totalResults>
  <movies>Nothing found.</movies>
</OpenSearchDescription>

So the request is working even though it doesn't find anything.

JR
Reply
#4
When I simply enter the URL in my browser I get the same XML info.

From the Curl website I got this explanation for the error code, maybe it helps:

CURLE_HTTP_RETURNED_ERROR (22)
This is returned if CURLOPT_FAILONERROR is set TRUE and the HTTP server returns an error code that is >= 400.

Too bad Curl does not output the exact HTTP error. So it could be 401 Unauthorized, or 403 Forbidden, or 404 Not Found, or anything..

One more thing: I get the same response ("unable to connect to remote server Would you like to continue scanning") for all my movies, so not just this one.

Thanks!
Reply
#5
See if uninstalling and reinstalling the YouTube add-on does the trick.

JR
Reply
#6
So I uninstalled the Youtube add-on yesterday evening.

Unfortunately I still get the same popup message when I start XBMC:
"unable to connect to remote server Would you like to continue scanning"

So I looked at the debug log, and it looks quite different this time. The "curl failed with code 22" message is gone now...

Could it be that this line is causing the problem this time:
"ADDON: cpluff: 'Could not return information about unknown plug-in plugin.video.youtube.'"

Debug log is here:
http://pastebin.com/B5hSs9Pi

Thanks again!
Reply
#7
stefanroelofs Wrote:So I uninstalled the Youtube add-on yesterday evening.

Did you reinstall the YouTube add-on again?

The YouTube add-on must be installed and working for the scraping to work. If the add-on has got broken for any reason, uninstalling and reinstalling it will fix it.

JR
Reply
#8
I've been trying to understand this problem too, as I can't reproduce it when I run curl on the command line.

I did a network capture of the http stream used by xbmc.

It ends like this:

GET /2.1/Movie.search/fr/xml/57983e31fb435df4df77afb854740ea9/airplane%202%2dreenc HTTP/1.1
User-Agent: XBMC/PRE-11.0 Git:20111003-879be40 (Linux; Arch Linux; 3.1.0-4-ARCH x86_64; http://www.xbmc.org)
Host: api.themoviedb.org
Accept: */*
Connection: keep-alive

HTTP/1.1 503 Service Unavailable
Server: nginx
Date: Sun, 13 Nov 2011 18:28:38 GMT
Content-Type: text/html; charset=utf-8
Connection: keep-alive
Keep-Alive: timeout=20
Retry-After: 0
Content-Length: 418
X-Varnish: 765874750
Age: 0
Via: 1.1 varnish
X-Cache: MISS


<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html>
<head>
<title>503 Service Unavailable</title>
</head>
<body>
<h1>Error 503 Service Unavailable</h1>
<p>Service Unavailable</p>
<h3>Guru Meditation:</h3>
<p>XID: 765874750</p>
<hr>
<p>Varnish cache server</p>
</body>
</html>

So it seems that the problem comes from themoviedb, not xbmc. But a retry in xbmc would be nice, as a workaround
Reply

Logout Mark Read Team Forum Stats Members Help
Yet another scraping issue..0