Tapatalk

UTF-8 without BOM and no CR/LF LF conversions

UTF-8 without BOM and no CR/LF LF conversions

1
NewbieNewbie
1

PostNov 23, 2006#1

Here is my problem:

I have to remove the BOM code from UTF-8 text files.
In UltraEdit I tried doing this by removing the 3 bytes using hex edit mode. So far so good.
Then I tried saving this file as UTF-8 without BOM.

The problem now is that the files contain a mix of CR/LF (0d0a) and LF (0a) which have to be kept that way. Saving the file however seems to either convert them to all CR/LF (windows) or all LF (Unix).

Is there a way to disable this CR/LF LF conversion?

6,825625
Grand MasterGrand Master
6,825625

PostNov 24, 2006#2

The easiest method to remove the UTF-8 BOM in your special case is to do it without opening the files. This can be done with a normal, non regular expression Replace In Files. You can do this by searching for  (= UTF-8 BOM) and replace it with nothing.

If you really want to edit files with mixed usage of CRLF and only LF as I was forced also in the past as I needed to edit some Opera M2 mail storage files (*.mbs) you can do this also with UltraEdit. But you have to uncheck everything at Configuration - File Handling - Unicode/UTF-8 Detection and set the config option UNIX/MAC file detection/conversion at DOS/UNIX/MAC Handling to value Disable.