Input arguments to the read_json interface.
More...
#include <json.hpp>
Public Types | |
| using | dtype_variant = std::variant< std::vector< data_type >, std::map< std::string, data_type >, std::map< std::string, schema_element >, schema_element > |
| Variant type holding dtypes information for the columns. | |
Public Member Functions | |
| json_reader_options ()=default | |
| Default constructor. More... | |
| source_info const & | get_source () const |
| Returns source info. More... | |
| dtype_variant const & | get_dtypes () const |
| Returns data types of the columns. More... | |
| compression_type | get_compression () const |
| Returns compression format of the source. More... | |
| size_t | get_byte_range_offset () const |
| Returns number of bytes to skip from source start. More... | |
| size_t | get_byte_range_size () const |
| Returns number of bytes to read. More... | |
| size_t | get_byte_range_size_with_padding () const |
| Returns number of bytes to read with padding. More... | |
| size_t | get_byte_range_padding () const |
| Returns number of bytes to pad when reading. More... | |
| char | get_delimiter () const |
| Returns delimiter separating records in JSON lines. More... | |
| bool | is_enabled_lines () const |
| Whether to read the file as a json object per line. More... | |
| bool | is_enabled_mixed_types_as_string () const |
| Whether to parse mixed types as a string column. More... | |
| bool | is_enabled_prune_columns () const |
| Whether to prune columns on read, selected based on the set_dtypes option. More... | |
| bool | is_enabled_experimental () const |
| Whether to enable experimental features. More... | |
| bool | is_enabled_dayfirst () const |
| Whether to parse dates as DD/MM versus MM/DD. More... | |
| bool | is_enabled_keep_quotes () const |
| Whether the reader should keep quotes of string values. More... | |
| bool | is_enabled_normalize_single_quotes () const |
| Whether the reader should normalize single quotes around strings. More... | |
| bool | is_enabled_normalize_whitespace () const |
| Whether the reader should normalize unquoted whitespace characters. More... | |
| json_recovery_mode_t | recovery_mode () const |
| Queries the JSON reader's behavior on invalid JSON lines. More... | |
| bool | is_strict_validation () const |
| Whether json validation should be enforced strictly or not. More... | |
| bool | is_allowed_numeric_leading_zeros () const |
| Whether leading zeros are allowed in numeric values. More... | |
| bool | is_allowed_nonnumeric_numbers () const |
| Whether unquoted number values should be allowed NaN, +INF, -INF, +Infinity, Infinity, and -Infinity. More... | |
| bool | is_allowed_unquoted_control_chars () const |
| Whether in a quoted string should characters greater than or equal to 0 and less than 32 be allowed without some form of escaping. More... | |
| std::vector< std::string > const & | get_na_values () const |
| Returns additional values to recognize as null values. More... | |
| void | set_source (source_info src) |
| Sets source info. More... | |
| void | set_dtypes (std::vector< data_type > types) |
| Set data types for columns to be read. More... | |
| void | set_dtypes (std::map< std::string, data_type > types) |
| Set data types for columns to be read. More... | |
| void | set_dtypes (std::map< std::string, schema_element > types) |
| Set data types for a potentially nested column hierarchy. More... | |
| void | set_dtypes (schema_element types) |
| Set data types for a potentially nested column hierarchy. More... | |
| void | set_compression (compression_type comp_type) |
| Set the compression type. More... | |
| void | set_byte_range_offset (size_t offset) |
| Set number of bytes to skip from source start. More... | |
| void | set_byte_range_size (size_t size) |
| Set number of bytes to read. More... | |
| void | set_delimiter (char delimiter) |
| Set delimiter separating records in JSON lines. More... | |
| void | enable_lines (bool val) |
| Set whether to read the file as a json object per line. More... | |
| void | enable_mixed_types_as_string (bool val) |
| Set whether to parse mixed types as a string column. Also enables forcing to read a struct as string column using schema. More... | |
| void | enable_prune_columns (bool val) |
| Set whether to prune columns on read, selected based on the set_dtypes option. More... | |
| void | enable_experimental (bool val) |
| Set whether to enable experimental features. More... | |
| void | enable_dayfirst (bool val) |
| Set whether to parse dates as DD/MM versus MM/DD. More... | |
| void | enable_keep_quotes (bool val) |
| Set whether the reader should keep quotes of string values. More... | |
| void | enable_normalize_single_quotes (bool val) |
| Set whether the reader should enable normalization of single quotes around strings. More... | |
| void | enable_normalize_whitespace (bool val) |
| Set whether the reader should enable normalization of unquoted whitespace. More... | |
| void | set_recovery_mode (json_recovery_mode_t val) |
| Specifies the JSON reader's behavior on invalid JSON lines. More... | |
| void | set_strict_validation (bool val) |
| Set whether strict validation is enabled or not. More... | |
| void | allow_numeric_leading_zeros (bool val) |
| Set whether leading zeros are allowed in numeric values. Strict validation must be enabled for this to work. More... | |
| void | allow_nonnumeric_numbers (bool val) |
| Set whether unquoted number values should be allowed NaN, +INF, -INF, +Infinity, Infinity, and -Infinity. Strict validation must be enabled for this to work. More... | |
| void | allow_unquoted_control_chars (bool val) |
| Set whether in a quoted string should characters greater than or equal to 0 and less than 32 be allowed without some form of escaping. Strict validation must be enabled for this to work. More... | |
| void | set_na_values (std::vector< std::string > vals) |
| Sets additional values to recognize as null values. More... | |
Static Public Member Functions | |
| static json_reader_options_builder | builder (source_info src) |
| create json_reader_options_builder which will build json_reader_options. More... | |
Input arguments to the read_json interface.
Available parameters are closely patterned after PANDAS' read_json API. Not all parameters are supported. If the matching PANDAS' parameter has a default value of None, then a default value of -1 or 0 may be used as the equivalent.
Parameters in PANDAS that are unavailable or in cudf:
| Name | Description |
|---|---|
orient | currently fixed-format |
typ | data is always returned as a cudf::table |
convert_axes | use column functions for axes operations instead |
convert_dates | dates are detected automatically |
keep_default_dates | dates are detected automatically |
numpy | data is always returned as a cudf::table |
precise_float | there is only one converter |
date_unit | only millisecond units are supported |
encoding | only ASCII-encoded data is supported |
chunksize | use byte_range_xxx for chunking instead |
Definition at line 85 of file io/json.hpp.
|
default |
Default constructor.
This has been added since Cython requires a default constructor to create objects on stack.
|
inline |
Set whether unquoted number values should be allowed NaN, +INF, -INF, +Infinity, Infinity, and -Infinity. Strict validation must be enabled for this to work.
| cudf::logic_error | if strict_validation is not enabled before setting this option. |
| val | Boolean value to indicate whether leading zeros are allowed in numeric values |
Definition at line 555 of file io/json.hpp.
|
inline |
Set whether leading zeros are allowed in numeric values. Strict validation must be enabled for this to work.
| cudf::logic_error | if strict_validation is not enabled before setting this option. |
| val | Boolean value to indicate whether leading zeros are allowed in numeric values |
Definition at line 541 of file io/json.hpp.
|
inline |
Set whether in a quoted string should characters greater than or equal to 0 and less than 32 be allowed without some form of escaping. Strict validation must be enabled for this to work.
| cudf::logic_error | if strict_validation is not enabled before setting this option. |
| val | true to indicate whether unquoted control chars are allowed. |
Definition at line 570 of file io/json.hpp.
|
static |
create json_reader_options_builder which will build json_reader_options.
| src | source information used to read json file |
|
inline |
Set whether to parse dates as DD/MM versus MM/DD.
| val | Boolean value to enable/disable day first parsing format |
Definition at line 493 of file io/json.hpp.
|
inline |
Set whether to enable experimental features.
When set to true, experimental features, such as the new column tree construction, utf-8 matching of field names will be enabled.
| val | Boolean value to enable/disable experimental features |
Definition at line 486 of file io/json.hpp.
|
inline |
Set whether the reader should keep quotes of string values.
| val | Boolean value to indicate whether the reader should keep quotes of string values |
Definition at line 501 of file io/json.hpp.
|
inline |
Set whether to read the file as a json object per line.
| val | Boolean value to enable/disable the option to read each line as a json object |
Definition at line 457 of file io/json.hpp.
|
inline |
Set whether to parse mixed types as a string column. Also enables forcing to read a struct as string column using schema.
| val | Boolean value to enable/disable parsing mixed types as a string column |
Definition at line 465 of file io/json.hpp.
|
inline |
Set whether the reader should enable normalization of single quotes around strings.
| val | Boolean value to indicate whether the reader should normalize single quotes around strings |
Definition at line 509 of file io/json.hpp.
|
inline |
Set whether the reader should enable normalization of unquoted whitespace.
| val | Boolean value to indicate whether the reader should normalize unquoted whitespace characters i.e. tabs and spaces |
Definition at line 517 of file io/json.hpp.
|
inline |
Set whether to prune columns on read, selected based on the set_dtypes option.
When set as true, if the reader options include set_dtypes, then the reader will only return those columns which are mentioned in set_dtypes. If false, then all columns are returned, independent of the set_dtypes setting.
| val | Boolean value to enable/disable column pruning |
Definition at line 476 of file io/json.hpp.
|
inline |
Returns number of bytes to skip from source start.
Definition at line 196 of file io/json.hpp.
|
inline |
Returns number of bytes to pad when reading.
Definition at line 224 of file io/json.hpp.
|
inline |
Returns number of bytes to read.
Definition at line 203 of file io/json.hpp.
|
inline |
Returns number of bytes to read with padding.
Definition at line 210 of file io/json.hpp.
|
inline |
Returns compression format of the source.
Definition at line 189 of file io/json.hpp.
|
inline |
Returns delimiter separating records in JSON lines.
Definition at line 250 of file io/json.hpp.
|
inline |
Returns data types of the columns.
Definition at line 182 of file io/json.hpp.
|
inline |
Returns additional values to recognize as null values.
Definition at line 369 of file io/json.hpp.
|
inline |
|
inline |
Whether unquoted number values should be allowed NaN, +INF, -INF, +Infinity, Infinity, and -Infinity.
Definition at line 349 of file io/json.hpp.
|
inline |
Whether leading zeros are allowed in numeric values.
Definition at line 336 of file io/json.hpp.
|
inline |
Whether in a quoted string should characters greater than or equal to 0 and less than 32 be allowed without some form of escaping.
Definition at line 359 of file io/json.hpp.
|
inline |
Whether to parse dates as DD/MM versus MM/DD.
Definition at line 292 of file io/json.hpp.
|
inline |
Whether to enable experimental features.
When set to true, experimental features, such as the new column tree construction, utf-8 matching of field names will be enabled.
Definition at line 285 of file io/json.hpp.
|
inline |
Whether the reader should keep quotes of string values.
Definition at line 299 of file io/json.hpp.
|
inline |
Whether to read the file as a json object per line.
true if reading the file as a json object per line Definition at line 257 of file io/json.hpp.
|
inline |
Whether to parse mixed types as a string column.
true if mixed types are parsed as a string column Definition at line 264 of file io/json.hpp.
|
inline |
Whether the reader should normalize single quotes around strings.
Definition at line 306 of file io/json.hpp.
|
inline |
Whether the reader should normalize unquoted whitespace characters.
Definition at line 313 of file io/json.hpp.
|
inline |
Whether to prune columns on read, selected based on the set_dtypes option.
When set as true, if the reader options include set_dtypes, then the reader will only return those columns which are mentioned in set_dtypes. If false, then all columns are returned, independent of the set_dtypes setting.
Definition at line 276 of file io/json.hpp.
|
inline |
Whether json validation should be enforced strictly or not.
Definition at line 327 of file io/json.hpp.
|
inline |
Queries the JSON reader's behavior on invalid JSON lines.
Definition at line 320 of file io/json.hpp.
|
inline |
Set number of bytes to skip from source start.
| offset | Number of bytes of offset |
Definition at line 419 of file io/json.hpp.
|
inline |
Set number of bytes to read.
| size | Number of bytes to read |
Definition at line 426 of file io/json.hpp.
|
inline |
Set the compression type.
| comp_type | The compression type used |
Definition at line 412 of file io/json.hpp.
|
inline |
Set delimiter separating records in JSON lines.
| delimiter | Delimiter separating records in JSON lines |
Definition at line 433 of file io/json.hpp.
| void cudf::io::json_reader_options::set_dtypes | ( | schema_element | types | ) |
Set data types for a potentially nested column hierarchy.
| types | schema element with column names and column order to support arbitrary nesting of data types |
|
inline |
Set data types for columns to be read.
| types | Vector dtypes in string format |
Definition at line 390 of file io/json.hpp.
|
inline |
Set data types for a potentially nested column hierarchy.
| types | Map of column names to schema_element to support arbitrary nesting of data types |
Definition at line 397 of file io/json.hpp.
|
inline |
Set data types for columns to be read.
| types | Vector of dtypes |
Definition at line 383 of file io/json.hpp.
|
inline |
Sets additional values to recognize as null values.
| vals | Vector of values to be considered to be null |
Definition at line 581 of file io/json.hpp.
|
inline |
Specifies the JSON reader's behavior on invalid JSON lines.
| val | An enum value to indicate the JSON reader's behavior on invalid JSON lines. |
Definition at line 524 of file io/json.hpp.
|
inline |
|
inline |
Set whether strict validation is enabled or not.
| val | Boolean value to indicate whether strict validation is enabled. |
Definition at line 531 of file io/json.hpp.