-
Notifications
You must be signed in to change notification settings - Fork 35
Import CSV with NULL values #401
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @danielnaab, nullable support is under active development for the next TileDB core release. Right now, https://github.com/TileDB-Inc/TileDB-Py/blob/dev/tiledb/tests/test_pandas_dataframe.py#L506-L509 For your dataset, I tested the following call to successfully write/read a TileDB array from the CSV file:
Compare to reading directly with pandas:
|
Hi @ihnorton thanks for the clarification and example! The use case I'm evaluating TileDB for involves lots of unpredictable data, so that would be a very welcome feature addition. By next core release, do you mean the next minor or major release? ie, TileDB 2.2 or 3.x? |
TileDB 2.2, which we are aiming to release by mid-November (@joe-maley is pretty far along with the project and I believe we are going to start testing the branch with Java and MariaDB this week, followed by Python). Would you be up for a call to discuss? We really like to have several driving external users for new features, to make sure we satisfy your use-case (as well as provide high-bandwidth fixes and updates if needed when it is first released). If so, please let us know at |
It is now possible to use Example (using TileDB-Py 0.12.1):
(please ping if needed and happen to re-open or open another issue) |
I'm having a problem processing CSVs that contains NULL values. For example:
Produces the following exception:
Is this a naive usage of
tiledb.from_csv
(ie, I'm doing it wrong), or should I expect it to handle empty values like in this sample CSV?The text was updated successfully, but these errors were encountered: