I have a file encoded as ISO-8859 (according to the file command at the command line). it is the ratings.file from imdb's database, and BBEdit says it's "Western (Mac OS Roman)"

I need the file to be plain ASCII so that I can do grep searches against it via a php script. Here is some sample data:

      0000000123  119567   8.6  LÈon (1994)
0000000124 120390 8.6 Fabuleux destin d'AmÈlie Poulain, Le (2001)
      0000000123   24627   8.5  RashÙmon (1950)
      0000000124   69931   8.4  Vita Ë bella, La (1997)
      0000000123   12564   8.3  Smultronst‰llet (1957)
      0000000114   17411   8.2  8Ω (1963)

I can Zap Gremlins to replace with the code

      0000000123  119567   8.6  L\0xC8on (1994)
0000000124 120390 8.6 Fabuleux destin d'Am\0xC8lie Poulain, Le (2001)

But that doesn't help me in doing a grep search through the file.

I also don't understand why "Smultronstället" shows up as "Smultronst ‰llet" or why 'LÈon' appears instead of 'Léon', etc.

what I want is 'Leon', 'Fabuleux destine d'Amelie Poulain, Le', 'Rashomon', 'Vita e bella, La', and 'Smultronstallet' and '8 1/2'.

And it needs to be fairly quick and easy to fix because I need to update this file every month or two.

And if anyone knows what I am doing: yes, I did try to compile the moviedb-3.24 package under Leopard and failed badly.

--
We will fight for Bovine Freedom and hold our large heads high
We will run free with the Buffalo or die



--
------------------------------------------------------------------
Have a feature request? Not sure the software's working correctly?
If so, please send mail to <[EMAIL PROTECTED]>, not to the list.
List FAQ: <http://www.barebones.com/support/lists/bbedit_talk.shtml>
List archives: <http://www.listsearch.com/BBEditTalk.lasso>
To unsubscribe, send mail to:  <[EMAIL PROTECTED]>

Reply via email to