IPB

Welcome Guest ( Log In | Register )

> Notice!

Please take a minute to check our Frequently Asked Questions. Use Search to reveal possible related topics.

Also make sure you've read the Forum Guidelines before posting in this forum.

> Case conversion...
yog-sothoth
post Mar 9 2011, 14:03
Post #1


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



Howdy. I've stumbled across this Action script that formats all fields to title case. It works very well, except that I would like it to ignore words that are upper-case. For instance, words (or acronyms without periods) like UFO are converted to Ufo. I've left a message for the script's author but have not yet had a reply, so I thought I'd ask the geniuses over here. wink.gif

So, any ideas?

Also, I'm wondering what similar scripts others here are using, so alternative suggestions are welcome. Thanks!

----------------------------------------------------------------------------------
Update: What began as a humble effort to format my mp3 tags to title case has become a bit of an odyssey. With perseverance and a lot of help from the wonderful people that frequent this forum, I have assembled a script that is intended as a comprehensive solution to the problem of standardising, grammatically speaking, all text fields of a digital music collection - including filenames and parent directories.

Presenting...
Grammartron: Smart Case and Grammar Restorer

Features include...*
  • Trim trailing, preceding and extra spaces
  • Enforce correct spacing between words and punctuation
  • Add missing apostrophes to word contractions
  • True title case conversion (upper-case words and letters are preserved)
  • CamelCasing of common Scottish and Irish names
  • Upper-case Roman numerals (up to LXXIX)

Notes
  1. Filenames: Track numbers should ideally be separated from the title by a boundary (e.g "-"). For instance: "%track% - %title%.ext". This avoids (non-critical) problems arising from the title case function. Edit for clarification: articles, conjunctions, etc. (e.g. "the", "and"), are usually lower-case, except if they are the first or last word in the group. In the following example: "01 The End.mp3", "The", although being the first word of the title, is considered as the second word after "01", as all alphanumeric characters are treated equally. Thus we end up with "01 the End.mp3". To avoid this, use a non-space word boundary between the track number and title.
  2. It is advisable to run the "Directory Names" script on its own, in a separate process. The "Tags" and "Filenames" scripts can be used together if desired.
  3. This script adds missing apostrophes to word contractions. Some CD burning software (Nero 9 and earlier versions in particular), are reported to have problems handling apostrophised words and may cause programme instability or aborted burning runs.

Acknowledgements

This script would not have been possible without the help of certain people. Special thanks goes to Liquid Parallax, whose work formed the basis for this script; Doug Mackie, whose awesome script I have shamelessly plagiarised; Dano, author of the core title case action and whose skill with regular expressions is second to none; DetlevD, who helped with Roman numerals; Zoofield, whose script (word contractions) I also ripped off. ph34r.gif Thanks guys, you rock!

*For the complete list of functions, please refer to the descriptions contained within the source code by opening the files in a text editor.

This post has been edited by yog-sothoth: May 15 2011, 08:49
Attached File(s)
Attached File  Grammartron_v1.0.zip ( 8.06K ) Number of downloads: 617
 
Go to the top of the page
 
+Quote Post
6 Pages V   1 2 3 > »   
Start new topic
Replies (1 - 14)
DetlevD
post Mar 9 2011, 14:24
Post #2


Member


Group: Full Members
Posts: 4797
Joined: 26-May 06
From: Wuppertal, Germany, Planet Earth
Member No.: 3194
Mp3tag Version: 2.61d



QUOTE (yog-sothoth @ Mar 9 2011, 14:03) *
... I would like it to ignore words that are upper case. ...

To save already upcased letters from title casing you can use the function $caps2.

DD.20110309.1430.CET


--------------------
* Beyond that, don't ask, when you don't know what to do with the answer. *
♥ home is where the heart is ♥
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 9 2011, 14:31
Post #3


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



QUOTE (DetlevD @ Mar 9 2011, 15:24) *
To save already upcased letters from title casing you can use the function $caps2.

DD.20110309.1430.CET


Oh good, but how do I integrate that into the script? Thanks.
Go to the top of the page
 
+Quote Post
Doug Mackie
post Mar 9 2011, 16:20
Post #4


Member


Group: Full Members
Posts: 35
Joined: 27-April 09
From: New Jersey, USA
Member No.: 9952
Mp3tag Version: 2.51



QUOTE (yog-sothoth @ Mar 9 2011, 08:03) *
... I'm wondering what similar scripts others here are using, so alternative suggestions are welcome.


Here are my two title-case MTA files: Title Case Action Files (Mackie)

They add a series of corrections after running Mp3tag's Mixed Case function. Included are editable lists of abbreviations and acronyms that should be upper-case. If you open the MTA files in Notepad, you will see comments that describe each element in the action.

Note that my tag field actions cover just three fields and are slightly different for each field. That is by design but of course you can edit them as needed.

My regards and thanks to the posters here (including DetlevD) for the examples and explanations that I drew upon when creating these files.

Doug Mackie
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 9 2011, 18:57
Post #5


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



QUOTE (Doug Mackie @ Mar 9 2011, 17:20) *
Here are my two title-case MTA files: Title Case Action Files (Mackie)

They add a series of corrections after running Mp3tag's Mixed Case function. Included are editable lists of abbreviations and acronyms that should be upper-case. If you open the MTA files in Notepad, you will see comments that describe each element in the action.

Note that my tag field actions cover just three fields and are slightly different for each field. That is by design but of course you can edit them as needed.

My regards and thanks to the posters here (including DetlevD) for the examples and explanations that I drew upon when creating these files.

Doug Mackie


Thanks, I'll give it a go. With regards to my other query, would you have an idea of how to achieve this, either using your script or the one I provided? Cheers.
Go to the top of the page
 
+Quote Post
Doug Mackie
post Mar 9 2011, 20:16
Post #6


Member


Group: Full Members
Posts: 35
Joined: 27-April 09
From: New Jersey, USA
Member No.: 9952
Mp3tag Version: 2.51



Yog-Sothoth, you are quite welcome.

Your action starts with the supplied Mixed Case function, as does mine. At that point, your abbreviations have already lost capitalization after the first letter. That is why in my script I added a list of abbreviations to correct the errors. If you were writing a title-case script from scratch (i.e., not using Mixed Case), then $caps2() would be exactly what you want. My guess is that is what DetlevD meant, but of course we should let him speak for himself.

I did not attempt to write my script from scratch because it was much easier to let Florian's built-in Mixed Case function do the heavy-lifting, so to speak, and then to apply corrections afterwards.

Doug Mackie
Go to the top of the page
 
+Quote Post
DetlevD
post Mar 9 2011, 22:21
Post #7


Member


Group: Full Members
Posts: 4797
Joined: 26-May 06
From: Wuppertal, Germany, Planet Earth
Member No.: 3194
Mp3tag Version: 2.61d



QUOTE (Doug Mackie @ Mar 9 2011, 16:20) *
Here are my two title-case MTA files: Title Case Action Files (Mackie) ...

Doug, your collection of 'cleaning and repair tools' looks very impressive, thank you for sharing it!

DD.20110309.2225.CET


--------------------
* Beyond that, don't ask, when you don't know what to do with the answer. *
♥ home is where the heart is ♥
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 9 2011, 22:56
Post #8


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



Cheers Doug, those scripts are very good. Thanks for your time.
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 14 2011, 18:21
Post #9


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



Ok, I've done a little digging and found a solution. All credit to Stevest.

Action: Replace with regular expression
Field: _ALL
Regular expression: ^(.*)$
Replace matches with: $caps2($1)
[ ] case-sensitive comparison

I just replaced the Mixed Case action with this. Works like a charm. laugh.gif

EDIT: THERE IS A PROBLEM WITH THIS SCRIPT. READ ON BEFORE IMPLEMENTING!

This post has been edited by yog-sothoth: Mar 16 2011, 21:19
Go to the top of the page
 
+Quote Post
DetlevD
post Mar 14 2011, 19:07
Post #10


Member


Group: Full Members
Posts: 4797
Joined: 26-May 06
From: Wuppertal, Germany, Planet Earth
Member No.: 3194
Mp3tag Version: 2.61d



QUOTE (yog-sothoth @ Mar 14 2011, 18:21) *
... I just replaced the Mixed Case action with this. Works like a charm. laugh.gif

What I said above in post#2.

DD.20110314.1908.CET


--------------------
* Beyond that, don't ask, when you don't know what to do with the answer. *
♥ home is where the heart is ♥
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 14 2011, 19:10
Post #11


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



QUOTE (DetlevD @ Mar 14 2011, 20:07) *
What I said above in post#2.

DD.20110314.1908.CET


Sorry, co-credit to DetlevD too. cool.gif
Go to the top of the page
 
+Quote Post
DetlevD
post Mar 14 2011, 19:11
Post #12


Member


Group: Full Members
Posts: 4797
Joined: 26-May 06
From: Wuppertal, Germany, Planet Earth
Member No.: 3194
Mp3tag Version: 2.61d



QUOTE (yog-sothoth @ Mar 14 2011, 19:10) *
Sorry, co-credit to DetlevD too. cool.gif

Thanks. walkman.gif

DD.20110314.1912.CET


--------------------
* Beyond that, don't ask, when you don't know what to do with the answer. *
♥ home is where the heart is ♥
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 16 2011, 21:17
Post #13


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



Update: There is a bug in the aforementioned action. If a comma is present anywhere in the field, all characters after the comma are truncated. For example, the title "Harp concerto in B-flat major, op. 4 Nr. 6: 1: Andante-Allegro" becomes "Harp ConCerto in B-Flat MaJor" after conversion. If I remove the comma beforehand, it converts to "Harp Concerto in B-flat Major Op. 4 Nr. 6: 1: Andante-Allegro", which is normal.

Humph, I guess I'm back to square one again. sad.gif Any ideas what is going wrong, anyone?

BTW, this is the offending action, just to avoid confusion:

Action: Replace with regular expression
Field: _ALL
Regular expression: ^(.*)$
Replace matches with: $caps2($1)
[ ] case-sensitive comparison

This post has been edited by yog-sothoth: Mar 16 2011, 21:17
Go to the top of the page
 
+Quote Post
Doug Mackie
post Mar 16 2011, 22:20
Post #14


Member


Group: Full Members
Posts: 35
Joined: 27-April 09
From: New Jersey, USA
Member No.: 9952
Mp3tag Version: 2.51



QUOTE (yog-sothoth @ Mar 16 2011, 16:17) *
Update: There is a bug in the aforementioned action. If a comma is present anywhere in the field, all characters after the comma are truncated.


I can confirm the comma problem but I can't explain it. Perhaps someone else can.

However, that action has other problems. It preserves upper-case errors in the source text, which must then be fixed by hand. More important, unlike "Mixed Case", "Replace with Regular Expression" lacks the option for custom word boundary markers. So you must then add actions to correct the resulting errors after punctuation, brackets, and so on. Perhaps the regular expression could be rewritten to minimize the effects of this limitation, but it would be a challenge. The number of abbreviations and acronyms in my titles is limited, so in my scripts I prefer a "Mixed Case" action followed by lists with corrections. There are no problems with commas.
Go to the top of the page
 
+Quote Post
yog-sothoth
post Mar 16 2011, 22:53
Post #15


Member


Group: Full Members
Posts: 64
Joined: 8-November 10
From: UK
Member No.: 13190
Mp3tag Version: 2.48



QUOTE (Doug Mackie @ Mar 16 2011, 23:20) *
It preserves upper-case errors in the source text, which must then be fixed by hand. More important, unlike "Mixed Case", "Replace with Regular Expression" lacks the option for custom word boundary markers. So you must then add actions to correct the resulting errors after punctuation, brackets, and so on.


Unless I've misinterpreted you, I believe the other scripts in the action group (see first post) should correct these problems. I could be wrong on that though, so please tell me if you think I'm mistaken.

QUOTE (Doug Mackie @ Mar 16 2011, 23:20) *
The number of abbreviations and acronyms in my titles is limited, so in my scripts I prefer a "Mixed Case" action followed by lists with corrections. There are no problems with commas.


Granted, but I'm not so fortunate. I have quite a large collection (around 70k tracks), a great deal of which has upper-case words that would be incorrectly formatted with the mixed case action.
Go to the top of the page
 
+Quote Post

6 Pages V   1 2 3 > » 
Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



RSS Lo-Fi Version Time is now: 26th July 2014 - 18:12