---json { "page_id": "zoevghaij9tk9fs9hmj0c" } --- ====== Data Resource - Text File ====== ===== About ===== A ''text'' file is a [[file|file]] [[resource|data resource]] that contains ''characters'' data. ===== Transfer ===== In a [[:docs:op:transfer|transfer]]: * as a [[:docs:flow:input|input]], it has only one column * as a [[docs:flow:target|target]], it's considered a free form structure that accepts any number of columns. ===== Attributes ===== A ''text file'' allows to specifies the following [[attribute|attributes]]. ^ Name ^ Default Values ^ Description ^ | ''end-of-record'' | ''\r\n'', ''\n'' or ''\r'' on Read \\ ''\r\n'' (Windows) or ''\n'' (Linux) on Write | The ''end of record (EOR)'' are the characters that design the end of a [[record|record]] (by default the end of line) | | ''character-set'' | Detected on Read \\ ''UTF-8'' on Creation | The character set that maps bits data to characters. | | ''column-name'' | ''Lines'' | The default name of the column when there is no [[:docs:resource:structure|structure]] defined. \\ A text file is then read as [[tabular|tabular data]] with one column of [[docs:data_type:data_type|type]] text. | Note also that the [[count|count common attribute]] gives you the number of record (line by default). ===== CharacterSet ===== The below values are the detected and most known ''characterSet'' values. If a value entered is not supported, you will get the whole list of supported character set. ^ Value ^ Languages (Description) ^ | ''US-ASCII'' | Seven-bit ASCII, a.k.a. | | ''UTF-16'' | Sixteen-bit UCS Transformation Format, \\ byte order identified by an optional byte-order mark | | ''UTF-16BE'' | Sixteen-bit UCS Transformation Format, big-endian byte order | | ''UTF-16LE'' | Sixteen-bit UCS Transformation Format, little-endian byte order | | ''UTF-8'' | Eight-bit UCS Transformation Format | | ''UTF-32BE'' | | | ''UTF-32LE'' | | | ''Shift_JIS'' | Japanese | | ''ISO-2022-JP'' | Japanese | | ''ISO-2022-CN'' | Simplified Chinese | | ''ISO-2022-KR'' | Korean | | ''GB18030'' | Chinese | | ''Big5'' | Traditional Chinese | | ''EUC-JP'' | Japanese | | ''EUC-KR'' | Korean | | ''ISO-8859-1'' | Danish, Dutch, English, French, German, Italian, \\ Norwegian, Portuguese, Swedish - ISO Latin Alphabet No | | ''ISO-8859-2'' | Czech, Hungarian, Polish, Romanian | | ''ISO-8859-5'' | Russian | | ''ISO-8859-6'' | Arabic | | ''ISO-8859-7'' | Greek | | ''ISO-8859-8'' | Hebrew | | ''ISO-8859-9'' | Turkish | | ''windows-1250'' | Czech, Hungarian, Polish, Romanian | | ''windows-1251'' | Russian | | ''windows-1252'' | Danish, Dutch, English, French, German, Italian, \\ Norwegian, Portuguese, Swedish | | ''windows-1253'' | Greek | | ''windows-1254'' | Turkish | | ''windows-1255'' | Hebrew | | ''windows-1256'' | Arabic | | ''KOI8-R'' | Russian | | ''IBM420'' | Arabic | | ''IBM424'' | Hebrew |