4027 Commits

Author SHA1 Message Date
anuunchin
033312d373 Fix: The child table column remains in the schema as a partial column with seen-null-first=True (#3131)
* child table column removed from parent

* A utility functin that checks whether a column has seen-null-first set

* Improved comments and docstrings, separate method in worker

* null column not inferred if exists as compound

* Column level x-normalizer cleaning moved outside of worker

* Test for empty column becoming compound

* Test clean_seen_null_first_hint
2025-11-23 18:42:33 +01:00
rudolfix
5242790b13 (fix) use sparse checkout for dlt init dlthub (#3356)
* adds option to sparse checkout repo

* use sparse checkout for llm context

* fixes sqlglot from find

* adds checkout after sparse clone

* explains unknown path tests
2025-11-22 12:15:40 +01:00
djudjuu
bbc1cb81cd fix: dashboard no longer crashes on broken home cell (#3348)
* split home and workspace render methods

* header row dry-er

* catch-all errors in home()-cell

* local try-catch for broken traces

* e2e test for broken trace

* removes this

* shows navigation on pipeline attach error

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-11-21 20:56:28 +01:00
rudolfix
c943d1c898 (docs) adds community destinations (#3326)
* adds community destinations

* Apply suggestions from code review

applies crate fixes

Co-authored-by: Andreas Motl <andreas.motl@elmyra.de>

---------

Co-authored-by: Andreas Motl <andreas.motl@elmyra.de>
2025-11-21 20:12:05 +01:00
rudolfix
c29264114f fix: backwards compatible traces (#3354)
* makes trace backward compat with 1.17.0 and earlier

* skips trace if any error in unpickle

* always saves merged pipeline trace to have consistent pipeline.last_trace property

* tests for past traces, broken traces and other improvements
2025-11-21 09:09:05 -05:00
rudolfix
8bd0b116fb fixes athena refresh mode (#3313)
* adds filter to exclude dropped tables in staging destination, implements for athena

* enables refresh mode tests for athena, fixes tests

* fixes staging_allowed_local_path on databricks, bumps databricks connector in lockfile

* passes dropped tables schemas to filter, adjust athena filter

* allows to disable lake formation
2025-11-21 10:58:54 +01:00
rudolfix
3bd5099951 fixes sqlglot from find (#3357) 2025-11-20 22:42:36 +01:00
Tim Hable
8a23408366 Redshift feature: Include STS session token in COPY CREDENTIALS. (#3307)
* Redshift feature: Include STS session token in COPY CREDENTIALS. If aws_session_token is present, append the session token. Keeps IAM_ROLE path and long-lieved keys unchanged

---------

Co-authored-by: Tim Hable <thable@varengold.de>
2025-11-20 22:42:16 +01:00
Violetta Mishechkina
b08f2334a8 docs: update weaviate destination docs and version (#3352) 2025-11-20 15:45:00 -05:00
Anton Burnashev
fa06885fe2 Fix DocSearch v4 styles (#3338)
* Fix DocSearch v4 styles
* Fix search input styles for light and dark modes
2025-11-20 15:21:02 +01:00
djudjuu
7702f5c17f override local marimo theme for dashboard (#3337)
This ensures custom CSS is always readable.
2025-11-19 16:18:21 -05:00
David Scharf
4a5ffd82b3 Chore: Update docs npm dependencies and clean up docs build tooling (#3247)
* bump npm deps

* remove unneeded netlify redirects file

* remove unneeded lockfile

* remove another unneeded lockfile

* post rebase lockfile update

* remove old netlify command

* create new docs tools project and move api docs gen there

* tmp

* add uv to build docs workflow

* move docs pyproject

* re-org docs pcakage and move snippet linter

* move notebook linting commands and deps to tools folder
add flake8 to tools linting

* remove unneeded files

* fix linting and formatting errors

* remove wrong file

* move docs processing script to new package

* fix gen api ref

* clean up package json and use commands from parent makefile

* update build website workflow

* move linting to docs makefile partially

* fix python version for docs project

* consolidate docs commands in docs makefile

* fix docs linter

* fully update docs test flow

* fixes some linting and dependency problems

* fix constants

* move notebook formatting to docs project

* fix lint embedded snippets

* fix examples tests

* add missing dependencies

* fix snippet linting

* add missing lint dependencies to core and missing test dependencies to docs

* add missing weaviate

* add missing regex module

* add forked dependency and updates readme file

* revert accidental change to example

* fix main linter

* * Move relevant pytest options to subproject
* Remove shims / path inserts that are now managed by pytest options
* Some typing fixes
* Clean up base project pytest ini
* Enable transformation snippets tests

* remove unneeded raw import of intro snippets

* downgrade alive progress

* uses dlt logger which also fixes internal alive error

* enables transformation snippets linting

* fixes dashboard races again

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-11-16 18:01:30 +01:00
Taha Muzammil
7b6f8c4ebd fix: minor typos and redundant variable (#3314) 2025-11-14 21:52:56 +01:00
anuunchin
e7e54b2cdf Feat: last pipeline run section in dashboard (#3250)
* Initial commit

* Html cleaned

* Summary moved to home section, migration badge added

* Load package status badges improved

* Test getting steps data, migrations count

* Various tests

* Fix in test

* Styles moved, improved ui

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-11-14 19:58:06 +01:00
ivasio
dc1a0467f8 Feat: support return_type = arrow_stream for connectorx backend (#3218)
* make arrow_stream default return_type for connectorx backend

* formatting

* bump connectorx version

* return to arrow by default, keep arrow_stream support, add info message

* document arrow_stream cornercases in the docs

* add the test for connectorx arrow_stream return type

* fix formatting

* fix test typo

* fix the tests

* fix package version check, return original version constraint

* adds utils function to losless cast date64 to timestamp[us]

* cast date64 to timestamp for connectorx, update test

---------

Co-authored-by: ivasio <ivan@dlthub.com>
Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-11-14 19:57:06 +01:00
Brandon Freeman
cf4094c574 Make with_table_name and other functions available through dlt.pipeline.mark again. (#3318) 2025-11-14 19:56:37 +01:00
Menna
4d25a6c5b5 feat/3198-add-workspace-info-and-profile-selection
Added a dropdown for profile selection in the dashboard interface and updated the layout to display profile and workspace information inline with pipeline selection.
2025-11-14 18:44:45 +01:00
Martin Bach
4224e88c29 Docs: fix footer in darkmode, add scaffoldigns link (#3309) 2025-11-14 13:15:49 +01:00
dat-a-man
7dfa61fc60 updated the sql databases configuration docs (#3107)
* updated the sql databases configuration docs

* Updated sql database and table sources as well which is nice

* updated

* Updated

* Updated docstrings for defer_table_reflect parameter in SQL Database source.

* Updated
2025-11-14 11:55:16 +01:00
David Scharf
619402857b enable ci runs for PRs against the runtime branch (#3317) 2025-11-14 11:54:02 +01:00
Alena Astrakhantseva
f18f6b8d4a Update deploy-with-dagster.md (#3287) 2025-11-13 16:36:33 +01:00
dat-a-man
4fa832ee9e Add example to SQL docs: updated docs on how to filter rows using query_adapter_callback (#3253)
* Updated docs final

* Updating the section, making it LLM friendly

* Minor liniting errors
2025-11-13 16:12:59 +01:00
Thierry Jean
152555a54e feat: updated scaffolding template (#3275)
* updated template

* renamed pipeline; set dev_mode=True

* set refresh to drop everything

* lint fixes

* makes dataset_name optional in init scripts

* falls to object __repr__ if Pipeline repr fails

* allows to instantiate pipeline on pipeline script import

* remove dev_mode kwarg

* fix unrelated docs notebook

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-11-12 20:41:55 -05:00
ivasio
0243f95781 fix formatting (#3305)
Co-authored-by: ivasio <ivan@dlthub.com>
2025-11-12 16:56:44 -05:00
Menna
8a16442293 fix/3165: Athena LakeFormation permissions are required even tho Lakeformation is not used
Fixed the bug that enforces the need for Lakeformation permissions when Lakeformation is not being used.
2025-11-12 14:09:36 +01:00
molkazhani2001
229f05f42f add init_replication description and required permissions (#3020)
* add init_replication description and required permissions

* add unit 3

* pl_replciation cleaned up

* pg replication docs improved

* pg_replicaiton.md made clearer

---------

Co-authored-by: anuunchin <88698977+anuunchin@users.noreply.github.com>
2025-11-10 16:14:54 +01:00
Aashish Nair
b71feca1b0 Marimo docs page: added quotations to pip install ibis-framework[duckdb] command (#3304) 2025-11-10 16:07:46 +01:00
Jay Jaisankar
83245608b0 Basics course - Reinitialize packages after exit() is called (#3300)
* Add package after exit()

* Add TDataItems module
2025-11-10 15:39:09 +01:00
김승훈
8b645faa7a docs: update incorrect LLM-native workflow link (404 error) (#3294) 2025-11-06 15:43:18 +01:00
rudolfix
928310aefb docs - improves hub docs (#3282)
* adds tools to generate api reference for workspace

* writes install, mcp, api reference and improves other docs in hub

* Apply suggestions from code review

Co-authored-by: Violetta Mishechkina <sansiositres@gmail.com>

* fixes free tier

---------

Co-authored-by: Violetta Mishechkina <sansiositres@gmail.com>
2025-11-04 15:14:34 +01:00
Violetta Mishechkina
eb2d3a21fe Minor hub docs polishing (#3284)
* Minor hub docs polishing

* fixes workflow setup wrt not running certain steps if there are only docs changes

* Remove the duplicate content

* Fix build

---------

Co-authored-by: David Scharf <shrps@posteo.net>
2025-11-04 12:39:40 +01:00
Marcin Rudolf
99f33ad614 Merge branch 'master' into devel 2025-11-03 13:57:19 +01:00
Marcin Rudolf
9a2b7a7db0 fixes typo in docs intro 2025-11-03 13:31:40 +01:00
rudolfix
6811dd7044 clones command repos in global_dir & bumps to 1.18.2 (#3279)
* clones command repos into global_dir, not data_dir

* bumps to version 1.18.2
2025-11-03 13:22:18 +01:00
rudolfix
d671376e68 fixes default limit in ibis backend (#3273) 2025-11-01 16:34:14 -04:00
rudolfix
39d870c7f3 refresh docs intro (master) (#3274)
* renames pipeline to workspace dashboard

* refreshes intro

* review changes

* sidebar, references, dataset.table( cleanup
2025-10-31 17:32:18 +01:00
rudolfix
4a431d60ed refresh docs intro (#3270)
* renames pipeline to workspace dashboard

* refreshes intro

* review changes

* sidebar, references, dataset.table( cleanup
2025-10-31 17:14:49 +01:00
rudolfix
fa02892c9c Merge pull request #3267 from dlt-hub/devel
master merge for 1.18.1 release
1.18.1
2025-10-29 23:50:46 +01:00
rudolfix
192296f4f8 fixes git import and enables tests (#3262)
* enable hub tests

* removes erroneous git import

* enables tests with importing dlt into minimal alpine container

* imports workspace modules on demand

* bumps dlt to version 1.18.1

* fixes mssql hub test on mac

* review fixes
2025-10-29 21:32:07 +01:00
David Scharf
cbf9db47c4 Fix docs deployment (#3266)
* install watchdog in install command

* remove uneeded file

* amend update versions to run outside of uv
2025-10-29 15:14:29 +01:00
Thierry Jean
0bdf8dc424 feat: add dlt.hub.data_quality entrypoint (#3259) 2025-10-29 08:12:49 -04:00
rudolfix
c050556cc0 fixes installation and intro pages in hub (#3257) 2025-10-28 18:50:36 +01:00
rudolfix
0da0091599 Merge pull request #3258 from dlt-hub/devel
master merge for 1.18.0 release
1.18.0
2025-10-28 15:45:14 +01:00
Marcin Rudolf
df8ccecbb8 fixes flaky signal tests in pipelines 2025-10-28 14:03:36 +01:00
rudolfix
e56f617c0e adds more signal options (#3248)
* adds option in load that prevents draining pool on signal

* adds runtime pipeline option to not intercept signals

* refactors signal module

* tests new cases

* describes signal handling in running in prod docs

* bumps dlt to 1.18.0

* fixes tests forked

* removes logging and buffered console output from signals

* adds retry count to load job metrics, generates started_at in init of runnable load job

* allows to update existing metrics in load step

* finalized jobs require start and finish dates

* generates metrics in each job state and in each completed loop, does not complete package if pool drained but jobs left, adds detailed tests for metrics

* fixes remote metrics

* replaces event with package bound semaphore to complete load jobs early

* fixes dashboard to on windows

* improves signals docs

* renames delayed_signals to intercepted_signals
2025-10-28 13:56:24 +01:00
anuunchin
449d914d7a Fix: Empty columns that were previously flattened into compound ones violate freeze contract (#3226)
* Initial commit

* adds commented out test case that leaves columns with None

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-10-27 23:20:05 +01:00
Thierry Jean
718b636045 fix: .to_ibis() query normalization + docs update (#3225)
* use dlt.Dataset query normalization in _DltBackend

* pass dlt SQL cursor to _DltBackend instead of return values

---------

Co-authored-by: Marcin Rudolf <rudolfix@rudolfix.org>
2025-10-27 13:50:04 -04:00
Violetta Mishechkina
38b0dec5a1 Add dlthub intro docs (#3241)
* Add dlthub intro

* Update with comments
2025-10-27 16:23:37 +01:00
anuunchin
335aea8eaf Fix: workspace package manifest has sorted files (#3251) 2025-10-27 16:20:06 +01:00
Alena Astrakhantseva
73e861f850 add profiles (#3252) 2025-10-25 08:14:04 +02:00