Kodi Community Forum
Why does TMM removes characters before scraping in the title? - Printable Version

+- Kodi Community Forum (https://forum.kodi.tv)
+-- Forum: Support (https://forum.kodi.tv/forumdisplay.php?fid=33)
+--- Forum: Supplementary Tools for Kodi (https://forum.kodi.tv/forumdisplay.php?fid=116)
+---- Forum: TinyMediaManager (https://forum.kodi.tv/forumdisplay.php?fid=204)
+---- Thread: Why does TMM removes characters before scraping in the title? (/showthread.php?tid=359398)



Why does TMM removes characters before scraping in the title? - Silent Strider - 2020-12-27

Just curious.

Example: all ASCII characters (except those, Windows doesn't allow as filename like " or ?)
Code:
!#$%&'()+,-.0123456789;=@ABCDEFGHIJKLMNOPQRSTUVWXYZ[]^_`abcdefghijklmnopqrstuvwxyz{}~

when loaded into TMM:
Code:
!#$%&' + 0123456789;=@abcdefghijklmnopqrstuvwxyz^ `abcdefghijklmnopqrstuvwxyz ~
so all brackets get removed or replaced with blanks, no matter which
, - . are all removed
_ is converted to blanks

Why is it doing so? Couldn't/Shouldn't it leave the title as it is before it gets eventually renamed after scraping? Or at least give the user the option to let TMM 'clean' the filename? (not really a cleaning on movies like "Mr. & Mrs. Smith" which becomes "Mr & Mrs Smith" or the German title of "The 40 Year-Old Virgin" which would be "Jungfrau (40), männlich, sucht…" and becomes "Jungfrau 40 männlich sucht")


RE: Why does TMM removes characters before scraping in the title? - mlaggner - 2020-12-28

because you never know how messy those file names are if tmm picks that up from the storage without any NFO. Just looked up your example how scene releases might look like:

Code:

The.40.Year.Old.Virgin.2005.THEATRICAL.720p.WEB.H264-FLAME
Jungfrau.40.maennlich.sucht.German.2005.AC3.DVDRip.x264.iNTERNAL-VhV
Mr.and.Mrs.Smith.German.2005.DL.PAL.DVDR.iNTERNAL-CiA

the first thing tmm needs to do is to remove _unnecessary_ data from the filename (badwords, stopwords). As you see in the examples above, the file names are pretty "clean" (only separated by a dot) - but tmm needs to pick up everything, so we clean as much as possible. And after the scrape the users will have a clean title in the NFO where file names do not matter


RE: Why does TMM removes characters before scraping in the title? - Silent Strider - 2020-12-29

Thank you for the clarification.

I would die of OCD if I had to work with such filenames to be honest. They seem like the default names for downloaded movies from torrent or alike. To each his own Smile

Case closed!