Hi,
sorry for late reply, but we had a wonderful day, relaxing with my wife and kids. But i've done a little bit of work and some new questions:
What is the content of the tag <outline>? Some more infos as tagline but not as much as in plot or plotsummary? Couldn't find it... At the moment I put in this tag a "short" plot (the one given at the main overview of IMDB).
What about <certification>? Is it deprecated and only MPAA is used instead?
Because of different DVDs, I've got more than one mpaa tag, e.g. 12years heavy cut, 16years cut, 18years uncut (it's not a single instance) at "The Rock" (IMDB-ID = tt0117500)
Is <originaltitle> a subset of <sorttitle>, e.g.
Code:
<originaltitle>
movie A
</originaltitle>
<sorttitle>
movie A has a nice long name
<sorttitle>
<sorttitle>
movie A /* is this included or not? */
<sorttitle>
<sorttitle>
movie A has a short name as well
<sorttitle>
<sorttitle>
movie A has a french name
<sorttitle>
Think this depends on the users preference...
What about the function GetIMDBThumbs? Does it fetch all pics from IMDB, or only the posters (and maybe product)? What are the constants SX, SY, SX$INFO and SY$INFO (or what is this)? Why is the function not repeated (think the users wants more than one thumbnail)? Don't know exactly what this function should do. Pointing to <http://www.imdb.de/title/tt0499549/mediaindex?refine=poster>? Any help?
How can I call a site without getting a "&" to "&" cleaned? Actually I used a function which removes the & and makes an & into the links :=( The "no HTML clean" tag does not work at all...
ok = meaning my scraper gathers the corresponding infos
n/a = not for import use or no infos given on german imdb site
stc = still to come => maybe implemented in future release (meaning: think it's a useless feature...)
Code:
<movie>
ok <id>tt0432337</id>
ok <title>Who knows</title>
ok <originaltitle>Who knows for real</originaltitle>
ok <sorttitle>Who knows 1</sorttitle>
n/a <set>Who knows triology</set>
ok <rating>6.100000</rating>
ok <votes>50</votes>
ok <year>2008</year>
ok <top250>0</top250>
ok <certification>MPAA for different countries</certification>
ok <mpaa>Not available</mpaa>
ok <studio>my camera</studio>
stc <outline>A look at the role of the Buckeye State in the 2004 Presidential Election.</outline>
ok <plot>A look at the role of the Buckeye State in the 2004 Presidential Election.</plot>
n/a <tagline></tagline>
ok <runtime>90 min</runtime>
ok <thumb>http://ia.ec.imdb.com/media/imdb/01/I/25/65/31/10f.jpg</thumb> /* Link broken/not working. Status 500 from server, using thumbs from MovieposterDB */
n/a <playcount>0</playcount>
n/a <watched>false</watched>
n/a <filenameandpath>c:\Dummy_Movie_Files\Movies\...So Goes The Nation.avi</filenameandpath>
stc <trailer></trailer>
ok <genre></genre>
ok <credits></credits>
stc <premiered>single instance/optional</premiered>
n/a <fileinfo>
n/a <streamdetails>
n/a <video>
n/a <codec>h264</codec>
n/a <aspect>2.35</aspect>
n/a <width>1920</width>
n/a <height>816</height>
n/a </video>
n/a <audio>
n/a <codec>ac3</codec>
n/a <language>eng</language>
n/a <channels>6</channels>
n/a </audio>
n/a <subtitle>
n/a <language>spa</language>
n/a </subtitle>
n/a </streamdetails>
n/a </fileinfo>
ok <director>Adam Del Deo</director>
ok <actor>
ok <thumb></thumb>
ok <name></name>
ok <role></role>
ok </actor>
</movie>
Actually most things are working pretty good, only thumbs and pictures are a bit unclear for me.
What format should <premiered> have? String with month written out, or date?
Eisbahn