Tapatalk

How can I find only Chinese?

How can I find only Chinese?

2
NewbieNewbie
2

    Sep 02, 2020#1

    Hello!

    I am using UltraEdit version 24. I only want to find Chinese. I have to use regular expressions, but I can't even browse the internet. Please help.

    Sample:

    DMISMN/'230T PORT5.dmi'
    V(V)=VFORM/ALL
    DISPLY/TERM,V(V),PRINT,V(V),STOR,V(V)
    FILNAM/'230T PORT5.dmi'
    $$
    $$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$
    $$ Create Date :
    $$ UserName(Computer User Account) : 
    $$ FilePath : D:\3 TRY OUT 新程序\DMIS\230T PORT5.dmi
    $$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$
    $$
    UNITS/MM,ANGDEC
    F(LN1)=FEAT/LINE,UNBND,CART,481.79076933,489.83000956,463.46876441,-0.99999988,0.00044643,0.00022010,1.00000000,0.00000000,0.00000000
       CONST/LINE,F(LN1),BF,FA(CIR2),FA(CIR1)
    OUTPUT/FA(LN1),COMT(),DESC( <- CIR2, CIR1的最短线),549869060096,0
    DATDEF/FA(LN1),DAT(A)
    D(NCS2)=DATSET/DAT(A),-XDIR,YORIG,ZORIG
    F(CIR3)=FEAT/CIRCLE,OUTER,CART,350.37442934,-0.21319177,42.59750557,1.00000000,0.00000000,0.00000000,41.96545523
       MEAS/CIRCLE,F(CIR3),3
       SNSET/DEPTH,0.0
          PTMEAS/CART,350.37671119,0.96601861,63.54707170,-0.00000000,0.05619910,0.99841958
          PTMEAS/CART,350.37477017,-21.15344251,43.93195574,-0.00000000,-0.99797563,0.06359755
          PTMEAS/CART,350.37180664,20.70627533,44.22561931,-0.00000000,0.99698511,0.07759305
       SNSET/DEPTH,0.0000
       ENDMES
    OUTPUT/FA(CIR3),COMT(),DESC(),113278976,0
    F(PT1)=FEAT/POINT,CART,0.00000000,0.00000000,0.00000000,1.00000000,0.00000000,0.00000000
    OUTPUT/F(PT1),COMT(),DESC(<- Define),113278976,0
    F(LN2)=FEAT/LINE,UNBND,CART,341.42900137,0.03302641,14.97848503,0.99903909,0.00009664,0.04382783,1.00000000,-0.00000000,-0.00000000
       CONST/LINE,F(LN2),BF,F(PT1),FA(CIR3)
    OUTPUT/FA(LN2),COMT(),DESC( <- PT1, CIR3的最短线),549869060096,0
    aaa.txt (26.78 KiB)   0

    6,685587
    Grand MasterGrand Master
    6,685587

      Sep 02, 2020#2

      Run a Perl regular expression find with search string [^\x00-\xFF]+ to find one or more characters with a code value NOT in range 0 to 255 (decimal, 00 to FF hexadecimal), i.e. exclude ASCII and Latin-1 characters. It would be also possible to use [^\x00-\x7F]+ (not in range 0 to 127 decimal) to exclude only ASCII characters and find all other.
      Best regards from an UC/UE/UES for Windows user from Austria

      2
      NewbieNewbie
      2

        Sep 03, 2020#3

        Thank you Mofi.
        I was able to solve the problem through your answers.
        I want a more detailed explanation.
        Where do I find documentation on x00,xFF, etc?
        I would like to apply a little more, such as finding only Korean characters and numbers.

        Best regard.

        6,685587
        Grand MasterGrand Master
        6,685587

          Sep 03, 2020#4

          Open help of UltraEdit, select tab Index, enter Perl and double click on first found item Perl Regular Expressions for a basic help on Perl regular expression syntax.

          The regular expression builder in find/replace window can be also used to build a find/replace regular expression. Click on button with the magnifying glass above find/replace edit box after having enabled the option Regular expressions and selected the regular expression engine. The opened list is complete for the legacy regular expression engines UltraEdit and Unix, but lists for Perl just the most often needed expressions as Perl regular expression syntax is so powerful and complex that it fills entire books and websites.

          The announcement topic Readme for the Find/Replace/Regular Expressions forum contains several links to online websites and webpages which explain Perl regular expression syntax including Boost C++ Libraries - Perl Regular Expression Syntax which is used in UltraEdit.

          See also the Wikipedia articles about Unicode and Korean language and computers listing in which code ranges the Korean characters are included in Unicode.

          There can be used the Perl regular expression search string [\x{1100}-\x{11FF}\x{3130}-\x{318F}\x{A960}-\x{A97F}\x{AC00}-\x{D7A3}\x{D7B0}-\x{D7FF}]+ to find one or more characters of Hangul Jamo, Hangul Compatibility Jamo, Hangul Jamo Extended-A, Hangul Syllables and Hangul Jamo Extended-B.
          Best regards from an UC/UE/UES for Windows user from Austria