No unicode support for filename/filepath?

No unicode support for filename/filepath?

2

    Jul 25, 2006#1

    Hi! I have a text file which contains unicode characters in the filename and filepath. When I tried to open it with Ultraedit 12.10a, it says
    C:\myfiles\???\????.txt contains an invalid path

    Will the next version of UE supports unicode fully, including filename and filepath?

    6,603548
    Grand MasterGrand Master
    6,603548

      Jul 25, 2006#2

      This is a user to user forum. No user can answer this question. Ask IDM support by email.

      My advice: NEVER use Unicode characters in file names and paths and NEVER create directory structures where the name+extension+path is longer than 259 ANSI characters.

      Unicode characters are only supported by NTFS, but not by FAT, FAT32 or some CD-ROM formats. And most programs also not support Unicode names of files/directories. So if you want compatibility with most of the world, don't use Unicode characters in file/directory names.

      See also the NTFS vs FAT comparison sheet.
      Best regards from an UC/UE/UES for Windows user from Austria

      2

        Jul 26, 2006#3

        Thanks for the reply. Unicode is a standard now because of multilingual environment. I have files in both english and chinese filenames. If the filename is in big5/gb characters, then it will show unreadable characters on my English XP OS. So I can only use Unicode.

        I don't think NTFS & FAT32 matters much because my harddisk is FAT32 and MS Notepad opens the text file correctly.

        6,603548
        Grand MasterGrand Master
        6,603548

          Jul 30, 2006#4

          I'm not an expert of MS deep secrets of it's file systems. Maybe you are right and FAT32 now in V1.03 also supports Unicode file names as explained in the DOC file Microsoft Extensible Firmware Initiative FAT32 File System Specification.

          However, I think UltraEdit supports Unicode file names too. I found following line in readme.txt for v11.10a.

          - Multibyte filenames(Korean) can now be opened from Project tree view.

          For a definitive answer on this question you have to ask IDM support by email. Please post the answer also here. I would be also interested in, if UltraEdit supports it or not, although I will have never a file on my disks with a Unicode file name.

          Have you tried it with both settings of the option Check short names against files already loaded to prevent loading of two copies of same file at Configuration - File Handling - Advanced?

          You can also use Filemon to look what's going on on your harddisk when UltraEdit tries to open the file with the Unicode characters. That's what I always do when access to files fails and I want to know why.
          Best regards from an UC/UE/UES for Windows user from Austria

          1
          NewbieNewbie
          1

            Mar 22, 2007#5

            There is a difference between multi-byte characters sets and unicode. The difference is subtle, but when it comes to implementing a program it will affect which system libraries are used.

            I have recently downloaded UltraEdit v13.00 and confirm that it will not open files which have file names that contain non-Latin characters. The explorer pane in UE also displays the filename incorrectly.

            2
            NewbieNewbie
            2

              Dec 07, 2009#6

              Still not fixed at v15.20.

              Notepad solved this problem about 10 years ago. As I pointed out to their tech support all they need to do is switch to a Unicode file open dialog and there would be no problem.

              I should just switch to another notepad replacement program but after being a registered user for 11 years I have gotten used to the facilities of UltraEdit.

              6,603548
              Grand MasterGrand Master
              6,603548

                Dec 07, 2009#7

                takarada wrote:Notepad solved this problem about 10 years ago. As I pointed out to their tech support all they need to do is switch to a Unicode file open dialog and there would be no problem.
                Notepad can't be used as reference here because with Notepad you can just open a text file, edit it, print content and save it and nothing more.

                And it is much more complicated then just using the Unicode version of the file open dialog. You have to think about all situations where the name of the file is used in UltraEdit: document array for scripts and other scripting commands; IfNameIs, IfExtIs, CopyFilePath in macros; File Names = and File Extensions = in wordfiles; file names in the output window, in the title bar, in the file tree view, find / replaces in files, projects, workspaces, FTP, Telnet, and so on. All type of configuration files (INI, PRJ, wordfiles, etc.) which are currently only in ASCII/ANSI must be changed to Unicode (for best downwards compatibility UTF-8) to support storing Unicode file names (histories, tools, project files, etc.). While ASCII/ANSI file names are limited in length to 259 characters, the limit for Unicode file names is 32767 characters. Have you ever seen a file not found error message or a file list when the full path of a file is really long, for example 1000+ characters. Many, many things must be changed to get full Unicode support for file names.

                The author of my favorite file manager Total Commander spent 1 year with more than 6 months public beta testing to get TC 99.5% Unicode file name compatible.

                The big advantage the IDM developers would have in comparison to the author of Total Commander: UltraEdit is only compatible with Win2K and later while TC is compatible with Win95 and any later version of Windows. I'm a programmer which has developed a Windows program which is compatible from Windows 98 to Windows 7 without Unicode support and with using very limited Windows functions and therefore know how much work it is to get a Windows application working on all versions of Windows without limitations, especially when the application has a history and is not written completely new. In my point of view the author of Total Commander is really a genius.

                A workaround, which TC used before v7.50 was released with full Unicode file name support, is to open files with Unicode characters with 8.3 short name when it is possible to get the short file name of a file with a Unicode character in path/name. But users often do not like it when they see their files with "encrypted" name in an application because the application used the 8.3 format of the file name.
                Best regards from an UC/UE/UES for Windows user from Austria

                2
                NewbieNewbie
                2

                  Dec 09, 2009#8

                  I think they have to do it. Right now most people with Unicode filenames would just give up on Ultraedit and go elsewhere.
                  There are several editors available that are Unicode compliant. I am just too used to Ultraedit to make changing easy.

                  But your suggestion to use the short name is useful. I can get dopus to pass the short path and file name to ultraedit.