I have a file encoded as ISO-8859 (according to the file command at
the command line). it is the ratings.file from imdb's database, and
BBEdit says it's "Western (Mac OS Roman)"
I need the file to be plain ASCII so that I can do grep searches
against it via a php script. Here is some sample data:
0000000123 119567 8.6 LÈon (1994)
0000000124 120390 8.6 Fabuleux destin d'AmÈlie Poulain, Le
(2001)
0000000123 24627 8.5 RashÙmon (1950)
0000000124 69931 8.4 Vita Ë bella, La (1997)
0000000123 12564 8.3 Smultronst‰llet (1957)
0000000114 17411 8.2 8Ω (1963)
I can Zap Gremlins to replace with the code
0000000123 119567 8.6 L\0xC8on (1994)
0000000124 120390 8.6 Fabuleux destin d'Am\0xC8lie Poulain,
Le (2001)
But that doesn't help me in doing a grep search through the file.
I also don't understand why "Smultronstället" shows up as "Smultronst
‰llet" or why 'LÈon' appears instead of 'Léon', etc.
what I want is 'Leon', 'Fabuleux destine d'Amelie Poulain, Le',
'Rashomon', 'Vita e bella, La', and 'Smultronstallet' and '8 1/2'.
And it needs to be fairly quick and easy to fix because I need to
update this file every month or two.
And if anyone knows what I am doing: yes, I did try to compile the
moviedb-3.24 package under Leopard and failed badly.
--
We will fight for Bovine Freedom and hold our large heads high
We will run free with the Buffalo or die
--
------------------------------------------------------------------
Have a feature request? Not sure the software's working correctly?
If so, please send mail to <[EMAIL PROTECTED]>, not to the list.
List FAQ: <http://www.barebones.com/support/lists/bbedit_talk.shtml>
List archives: <http://www.listsearch.com/BBEditTalk.lasso>
To unsubscribe, send mail to: <[EMAIL PROTECTED]>