## Output Data File Details The following table describes the general details for the output files generated by Forager: | **Feature** | **Supported** | **Notes** | | --- | --- | --- | | **Location of files** | **Files in Amazon S3** | Files can be unloaded directly to any user-supplied bucket in S3, then can be downloaded locally using AWS utilities. | | | **Files in Google Cloud Storage** | Files can be unloaded directly to any user-supplied container in Cloud Storage, then can be downloaded locally using Cloud Storage utilities. | | | **Files in Microsoft Azure** | Files can be unloaded directly to any user-supplied container in Azure, then can be downloaded locally using Azure utilities. | | **File formats** | **Delimited files (CSV, TSV, etc.)** | Any valid delimiter is supported; default is comma (i.e. CSV). | | | **JSON** | | | | **Parquet** | | | **File encoding** | **UTF-8** | Output files are always encoded using UTF-8, regardless of the file format; no other character sets are supported. | ### Example file Person NDJSON file: [data_0_0_0.ndjson](/assets/data_0_0_0.d57db7821548feaf629855f6ded1ec65b190fa448a57a88ec27265222c24889c.cdba73be.ndjson) Organization NDJSON file: [data_0_0_0.ndjson](/assets/data_0_0_0.e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855.46e9c3c3.ndjson) ## File paths & names File paths are constructed as so: ``` /forager_data_feed///_/data_.. # Example: /forager_data_feed/123/2025-10-15/person_1.0.0/data_0_0_1.json.gzip ``` Where the above variables represent: - `data_feed_id`: A Forager provided data feed ID, provide by our support team. - `data_feed_type`: Possible options are; `person`, `organization`, or `job`. - `schema_version`: Forager schema version that represents the versioning of serialized data for data feed type, this will be provided by your customer support rep. - `date_created`: Date when the export was created, Ex: `2025-10-16`. - `partition_id`: Partition column values generated when exporting your files, Ex: `0_0_0`, `0_0_1`, etc. - `file_type`: This will be one of the following options; `json`, `csv`, `tsv`, `parquet`.' - `compression_type`: This will be one of the following options; `gzip`, `bz2`, `brotli`, `zstd`. ## Serialized Data Schema [Schema located here.](/data-license/v2/schema)