Skip to content

Expose scan performance tuning options in read_lance#24

Open
beinan wants to merge 2 commits into
daft-engine:mainfrom
beinan:feat/scan-options
Open

Expose scan performance tuning options in read_lance#24
beinan wants to merge 2 commits into
daft-engine:mainfrom
beinan:feat/scan-options

Conversation

@beinan

@beinan beinan commented Jun 6, 2026

Copy link
Copy Markdown
Contributor

Summary

  • Add scan_options parameter to read_lance() that forwards kwargs to lance.LanceDataset.scanner() at scan time
  • Supports batch_size, batch_readahead, fragment_readahead, scan_in_order, late_materialization, prefilter, offset, use_scalar_index, io_buffer_size, and any future scanner params
  • Unknown keys are forwarded as-is for forward compatibility with newer Lance versions

Test plan

  • 4 new tests: scan_options alone, with filter+limit, with fragment grouping, explicit None
  • All 27 existing read tests pass (1 skipped for geo dependency)

🤖 Generated with Claude Code

Beinan Wang added 2 commits June 6, 2026 01:44
Add scan_options parameter to read_lance() that forwards kwargs like
batch_size, batch_readahead, fragment_readahead, scan_in_order, and
late_materialization to lance's scanner for read performance tuning.

Co-Authored-By: Beinan Wang <beinanwang@microsoft.com>
Co-Authored-By: Beinan Wang <beinanwang@microsoft.com>
@beinan beinan marked this pull request as ready for review June 6, 2026 21:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant