Tapatalk

UTF-8 without BOM and no CR/LF LF conversions

UTF-8 without BOM and no CR/LF LF conversions

1
NewbieNewbie
1

    Nov 23, 2006#1

    Here is my problem:

    I have to remove the BOM code from UTF-8 text files.
    In UltraEdit I tried doing this by removing the 3 bytes using hex edit mode. So far so good.
    Then I tried saving this file as UTF-8 without BOM.

    The problem now is that the files contain a mix of CR/LF (0d0a) and LF (0a) which have to be kept that way. Saving the file however seems to either convert them to all CR/LF (windows) or all LF (Unix).

    Is there a way to disable this CR/LF LF conversion?

    6,685587
    Grand MasterGrand Master
    6,685587

      Nov 24, 2006#2

      The easiest method to remove the UTF-8 BOM in your special case is to do it without opening the files. This can be done with a normal, non regular expression Replace In Files. You can do this by searching for  (= UTF-8 BOM) and replace it with nothing.

      If you really want to edit files with mixed usage of CRLF and only LF as I was forced also in the past as I needed to edit some Opera M2 mail storage files (*.mbs) you can do this also with UltraEdit. But you have to uncheck everything at Configuration - File Handling - Unicode/UTF-8 Detection and set the config option UNIX/MAC file detection/conversion at DOS/UNIX/MAC Handling to value Disable.
      Best regards from an UC/UE/UES for Windows user from Austria