Skip to content

Releases: Eventual-Inc/Daft

v0.2.22

30 Apr 06:48
c439cbc
Compare
Choose a tag to compare

Changes

This is the last release that will support Python 3.7 which has been EOL for about a year now.

✨ New Features

🚀 Performance Improvements

  • [PERF] Move with_column and exclude function logic to Rust side, add with_columns @kevinzwang (#2167)
  • [PERF] Refactor TreeNode to be native to Arc<TreeNode> @samster25 (#2175)

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

⬆️ Dependencies

5 changes

v0.2.21

17 Apr 00:36
3c9f3ce
Compare
Choose a tag to compare

Changes

✨ New Features

👾 Bug Fixes

  • [BUG] Fix reading partition key columns in DeltaLake @jaychia (#2118)

📖 Documentation

🧰 Maintenance

⬆️ Dependencies

5 changes

v0.2.20

03 Apr 18:30
8023c2e
Compare
Choose a tag to compare

Changes

✨ New Features

🚀 Performance Improvements

  • [PERF] [Delta Lake] Add IO multithreading arg to daft.read_delta_lake(). @clarkzinzow (#2029)

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

⬆️ Dependencies

10 changes

v0.2.19

20 Mar 07:37
c2db062
Compare
Choose a tag to compare

Changes

✨ New Features

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

v0.2.18

13 Mar 18:53
36b47e3
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] Top level global expressions @kevinzwang (#2000)
  • [FEAT] Add str.capitalize() function @murex971 (#2003)
  • [FEAT] Support reading Parquet files with Field ID @jaychia (#1990)
  • [FEAT] Enable JQ style JSON accessors on strings @colin-ho (#2001)
  • [FEAT] [Catalogs] [Delta Lake] Add support for AWS Glue Catalog and Databricks Unity Catalog integrations to Delta Lake reader @clarkzinzow (#1991)
  • [FEAT] Enable UDF to handle arbitrary number of Daft series @gmweaver (#1984)

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

  • [CHORE] [Hotfix] Remove pyarrow upper bound for Windows. @clarkzinzow (#2002)
  • [CHORE] [Catalogs] [Delta Lake] Add test coverage for Delta Lake reads on Azure. @clarkzinzow (#1970)
  • [CHORE] [Repartitioning] Refactor + hide PartitionSpec and rename to ClusteringSpec. @clarkzinzow (#1961)
  • [CHORE] Simplify cast to schema @jaychia (#1982)
  • [CHORE] Disables anonymous mode for S3 accesses in DeltaLake @jaychia (#1975)
  • [CHORE] Set DAFT_ANALYTICS_ENABLED=0 in nightly tests @jaychia (#1972)

v0.2.17

04 Mar 22:21
59046d3
Compare
Choose a tag to compare

Changes

✨ New Features

🚀 Performance Improvements

📖 Documentation

  • [DOCS] [Delta Lake] Add user guide for Delta Lake reads. @clarkzinzow (#1969)
  • [Catalogs] [Delta Lake] Add initial support for reading from Delta Lake. @clarkzinzow (#1879)
  • [DOCS] Fix notebooks by falling back on null for URL downloads @jaychia (#1951)
  • [DOCS] Add documentation for using and developing Daft on Ray @kevinzwang (#1896)
  • [DOCS] Update schema hints documentation @jaychia (#1935)

🧰 Maintenance

v0.2.16

20 Feb 04:18
1a0e740
Compare
Choose a tag to compare

Changes

✨ New Features

  • [FEAT] perform head operation instead of list when given a file without regex or / @samster25 (#1891)

🚀 Performance Improvements

v0.2.15

15 Feb 18:41
61dd574
Compare
Choose a tag to compare

Changes

👾 Bug Fixes

  • [BUG] dont create dirs if non local fs @samster25 (#1888)
  • [BUG] Fix Ray autoscaling from zero worker CPUs @kevinzwang (#1884)
  • [BUG] Attempt to skip IMDS if region or credentials are provided @samster25 (#1886)
  • [BUG] [Query Planner] Properly track ascending/descending sort order for range partitioning and sorting. @clarkzinzow (#1862)
  • [BUG] Fix bug with merge tasks that allows for tasks larger than max size allowed @samster25 (#1882)

📖 Documentation

🧰 Maintenance

v0.2.14

13 Feb 03:50
195174c
Compare
Choose a tag to compare

Changes

✨ New Features

🚀 Performance Improvements

  • [PERF] Set min_partitions for post aggregation shuffles @colin-ho (#1861)

👾 Bug Fixes

📖 Documentation

🧰 Maintenance

  • [CHORE] Update segment logging to use restricted set of IDs @jaychia (#1870)
  • [CHORE] Fix monotonically_increasing_id physical op @colin-ho (#1853)

v0.2.13

08 Feb 02:40
1fae7ad
Compare
Choose a tag to compare

Changes

✨ New Features

🚀 Performance Improvements

  • [PERF] Split parquet scan tasks into individual row groups @kevinzwang (#1799)

👾 Bug Fixes

📖 Documentation

  • [DOCS] Add docs for Azure IO @jaychia (#1851)
  • [Query Planner] Add physical plan visualization option to df.explain(); implement TreeVisitor for LogicalPlan and PhysicalPlan. @clarkzinzow (#1836)
  • [DOCS] Add type conversions between iceberg and daft @jaychia (#1835)
  • [DOCS] Add dedicated Iceberg page @jaychia (#1830)
  • [DOCS] Refactor expressions docs layout @jaychia (#1816)
  • [CHORE] Add is_in to docs @colin-ho (#1819)

🧰 Maintenance