You are not logged in.
Pages: 1
Can you add an option to clearly delete all Emoji expressions?
Last edited by jfjj007 (2023-03-03 16:02)
Offline
+1 I really need this feature... maybe add it to CleanUp rule set.
Offline
Here are our findings from an initial investigation into this idea...
The Unicode standard defines Emoji characters as a large set of disconnected ranges of code points. This set continues to grow year after year. Currently, it contains about 3600 code points. The full list of Emojis is also provided in a form of data files.
Implementation options to consider:
1. Offer an option to strip user-defined ranges of code points.
2. Offer an option to strip all Emojis using a built-in dictionary of all Emoji code points, which can be created by parsing the data files.
There is also bit of a confusion regarding which one file or a combination of Emoji data files captures the full range. According to the Unicode Technical Standard #51, the best definition of the full set is in the emoji-test.txt file.
Offline
Not sure if this helps or gives any ideas at all, but this is how yt-dlp strips emojis from filenames when downloading
--replace-in-metadata title "[\U00010000-\U0010ffff]" ""
Maybe having 2 options would be the best solution
- Cleanup Section: Strip All Emojis
and an option to Strip defined emojis or strip all except defined emojis.
I'd be happy with the option to just strip them all.
Thanks for considering and looking into the emoji issue.
Last edited by shuntensatsu (2023-03-15 14:10)
Offline
funny thing, I had a regex issue and asked ChatGPT to help me taking the regex engine of renamer by den4b into consideration and it did actually give me a solution that worked to remove date from second position in a file name.
^(.*?)\s+\d{8}\s+-\s+(.*)
$1 $2
it also gave two regex examples for stripping emojis,
\p{Emoji} or [\u{1F000}-\u{1F9EF}]
but that did not work.
Last edited by shuntensatsu (2023-03-16 12:57)
Offline
This feature was implemented in v7.6.0.2 Beta.
You can find the new option in the Clean Up rule.
Offline
Pages: 1