STechno

Note 788449 - Byte-order Marks in UTF-8 Files

Header
Version / Date 2 / 2005-02-28
Priority Recommendations/additional info
Category Program error
Primary Component BC-ABA-LA Syntax, Compiler, Runtime
Secondary Components

Summary
Symptom
    1. It is not detected that a file is encoded in UTF-8.
    2. If a UTF-8 file has a byte-order mark at the beginning, the first READ DATASET statement reads the byte-order mark into the memory, or, if the system is a non-Uniocde system, reads a '#' into the memory.
Other terms

CL_ABAP_FILE_UTILITIES, CHECK_FOR_BOM

Reason and Prerequisites

A byte-order mark can be used to indicate that a file is encoded in UTF-8 or UTF-16. For UTF-8, the byte-order mark is the byte sequence EF BB BF. The byte-order mark should not be read into the memory.

Solution
https://service.sap.com/sap/support/notes/788449 (SAP Service marketplace login required)

Key words : basis support, support package, method check, became available, dataset statement,

Affected Releases
Software Component Release From Release To Release And subsequent
SAP_BASIS60620640

Correction delivered in Support Package
Support Packages Release Package Name
SAP_BASIS620SAPKB62047
SAP_BASIS620SAPKB62048
SAP_BASIS620SAPKB62050
SAP_BASIS640SAPKB64010
SAP_BASIS640SAPKB64011
SAP_BASIS640SAPKB64012

Related Notes
1375438Globalization Collection Note
1319517Unicode Collection Note
1038151Legacy files in LSMW: BOM for UTF-8 not possible
910857Incorrect characters (#) in batch input session
863392RFBIBL00: The first record is not a session record
855495RFBIDE00 / RFBIKR00: First record is not a session record
752835Usage of the file interfaces in Unicode systems
747615Tool for converting files from one code page to another
27Recommendations for the ABAP file interface