CEMI's PATSTAT Knowledge Base
Cleaning individual text strings
français | english
Navigation
Ce wiki
Cette page

 

Actually our method for cleaning individual names is composed by the following sequence:

     

  1. Correcting all known corrupted characters (function DECORRUPT)

     

  2.  

  3. Eliminating all blank spaces heading, trailing or doubled in the individual's name (function DEDOUBLE)

     

  4.  

  5. Replacing all accentuated characters by its non accentuated version (function NOACCENTS)

     

  6.  

  7. Parsing out all characters not included in the 26 letter Latin alphabet (function JUSTABC)

     

 

 


 Home> Tools> Cleaning> Cleaning individual string 

Rechercher
Partager