|Applies To||RSA Product Set: Identity Governance & Lifecycle|
RSA Version/Condition: 7.0.0
User data collected from a .csv file with words having replacement characters, such as ê, display incorrectly in our user interface.
|Cause||When the HXTT driver reads the .csv-file, it uses client's default character set (which is set through Linux environment variables), unless the charSet property is set explicitly to use a particular set. However, Unicode files need to have ByteOrderMark to describe itself to the reader. Most likely, even if the JVM runs in UTF-8 locale, the absence of the ByteOrderMark in the file might make the reader to fall back to ASCII set.|
Essentially, whenever a non-ASCII file needs to read by HXTT driver, the charSet parameter always needs to be supplied in the URL to ensure the proper character set is used for the data.
|Resolution||Modify the collector configuration by adding the charSet=ISO-8859-1 in the connection URL. For example:|