dbt-core

Author	SHA1	Message	Date
Quigley Malcolm	7920b0e71d	Update microbatch tests to handle update wherein incremental strategies are always validated (#10884 ) dbt-adapters updated the incremental_strategy validation of incremental models such that the validation now _always_ happens when an incremental model is executed. A test in dbt-core `TestMicrobatchCustomUserStrategyEnvVarTrueInvalid` was previously set to _expect_ buggy behavior where an incremental model would succeed on it's "first"/"refresh" run even if it had an invalid incremental strategy. Thus we needed to update this test in dbt-core to expect the now correct behavior of incremental model execution time validation	2024-10-21 13:10:00 -07:00
Chenyu Li	a0674db840	exclude hook results from results in on-run-end context (#10885 ) * exclude hook results from results in on-run-end context * changelog * preserve previous behavior	2024-10-18 15:07:03 -07:00
Kshitij Aranke	ba6c7baf1d	[Tidy-First]: Fix `timings` object for hooks and macros, and make types of timings explicit (#10882 ) * [Tidy-First]: Fix `timings` object for hooks and macros, and make types of timings explicit * cast literal to str * change test * change jsonschema to enum * Discard changes to schemas/dbt/manifest/v12.json * nits --------- Co-authored-by: Chenyu Li <chenyu.li@dbtlabs.com>	2024-10-18 17:28:58 -04:00
Paul Yang	8be063502b	Add `order_by` and `limit` fields to saved queries (#10532 ) * Add `order_by` and `limit` fields to saved queries. * Update JSON schema * Add change log for #10531. * Check order by / limit in saved-query parsing test.	2024-10-17 10:54:30 -07:00
Gerda Shank	78c05718c5	Remove Python 3.8 from various places (#10861 ) * Remove Python 3.8 from various places * Add changelog entry. --------- Co-authored-by: Peter Allen Webb <peter.webb@dbtlabs.com>	2024-10-16 15:15:27 -04:00
Quigley Malcolm	d18f50bbb8	Ensure consistent `current_time` across microbatch models in an invocation (#10830 ) * Add test that checks microbatch models are all operating with the same `current_time` * Set an `invocated_at` on the `RuntimeConfig` and plumb to `MicrobatchBuilder` * Add changie doc * Rename `invocated_at` to `invoked_at` * Simply conditional logic for setting MicrobatchBuilder.batch_current_time * Rename `batch_current_time` to `default_end_time` for MicrobatchBuilder	2024-10-15 16:55:19 -05:00
Gerda Shank	ffa75ca9ff	Refactor code to properly handle reference deprecations (#10852 )	2024-10-15 16:44:27 -04:00
Gerda Shank	8f847167fa	Remove dbt_valid_to_current test (will go in adapter zone) (#10854 )	2024-10-15 15:39:01 -04:00
Kshitij Aranke	cd6bb9e782	Fix #2578 : Allow instances of generic data tests to be documented (#10850 )	2024-10-15 18:53:51 +01:00
Kshitij Aranke	ef9abe6c06	[Tidy-First] Fix `node_status` for hooks (#10845 )	2024-10-14 21:27:34 +01:00
Peter Webb	40c350ff21	Add better typing in jinja_static.py (#10835 ) * Add better typing in jinja_static.py, remove commented code, clarify names * Avoid circular dependency. * Actually work around the circular dependency.	2024-10-11 12:19:37 -04:00
Gerda Shank	c7d8693f70	Enable setting datetime value for dbt_valid_to when the record is current (#10780 )	2024-10-10 18:41:03 -04:00
Colin Rogers	6743e32574	add builder config to test node config (#10767 ) * add builder config to node config * add changie * raise expected exceptions * add code comment and additional tests * update tests * update tests	2024-10-10 14:32:19 -07:00
Quigley Malcolm	f6cdacc61e	Stop making microbatch batches with filters that will never have any rows (#10826 )	2024-10-08 18:56:10 -05:00
Quigley Malcolm	5db0b81da1	Track batch execution time for microbatch models (#10828 ) * Begin testing that microbatch execution times are being tracked and set * Begin tracking the execution time of batches for microbatch models * Add changie doc * Additional assertions in microbatch testing	2024-10-08 14:32:58 -05:00
Quigley Malcolm	fc8eb820aa	Validate `--event-time-start` is before `--event-time-end` (#10820 ) * Validate that `event_time_start` is before `event_time_end` when passed from CLI Sometimes CLI options have restrictions based on other CLI options. This is the case for `--event-time-start` and `--event-time-end`. Unfortunately, click doesn't provide a good way for validating this, at least not that I found. Additionaly I'm not sure if we have had anything like this previously. In any case, I couldn't find a centralized validation area for such occurances. Thus I've gone and added one, `validate_option_interactions`. Long term if more validations are added, we should add this wrapper to each CLI command. For now I've only added it to the commands that support `event_time_start` and `event_time_end`, specifically `build` and `run`. * Add changie doc * If `--event-time-end` is not specififed, ensure `--event-time-start` is less than the current time * Fixup error message about event_time_start and event_time_end * Move logic to validate `event_time` cli flags to `flags.py` * Update validation of `--event-time-start` against `datetime.now` to use UTC	2024-10-07 14:34:42 -05:00
FishtownBuildBot	fc83f5edfa	[Automated] Merged prep-release/1.9.0b2_11213923466 into target main during release process	2024-10-07 07:17:39 -04:00
Github Build Bot	8248d1eb53	Bumping version to 1.9.0b2 and generate changelog v1.9.0b2	2024-10-07 10:50:33 +00:00
Kshitij Aranke	6b9c1da1ae	Revert "state:modified vars, behind "state_modified_compare_vars" behaviour flag" (#10793 ) (#10813 )	2024-10-02 21:00:48 +01:00
Courtney Holcomb	7940ad5c78	Fix case-sensitivity in validation warning (#10807 )	2024-10-01 15:40:51 -05:00
Doug Beatty	3ec8fa79bd	`--inline-direct` is an internal CLI flag (#10806 )	2024-10-01 14:28:41 -06:00
FishtownBuildBot	396cf2d683	[Automated] Merged prep-release/1.9.0b1_11131260909 into target main during release process	2024-10-01 14:57:06 -04:00
Github Build Bot	87b1143a62	Bumping version to 1.9.0b1 and generate changelog v1.9.0b1	2024-10-01 18:30:08 +00:00
Kshitij Aranke	75a09621cd	[tidy_first] Set default for STATE_MODIFIED_COMPARE_VARS flag, mark TestProjInfo as not a test class (#10805 )	2024-10-01 17:27:44 +01:00
Kshitij Aranke	5e9f1b515f	[Round 2] Fix #9005 : Allow singular tests to be documented in properties.yml (#10792 )	2024-10-01 08:05:36 +01:00
Quigley Malcolm	25a68a990c	When retrying microbatch models, propagate prior successful state (#10802 ) * When retrying microbatch models, propagate prior successful state * Changie doc for microbatch dbt retry fixes * Fix test_manifest unit tests for batch_info key * Add functional test for when a microbatch model has multiple retries * Add comment about when batch_info will be something other than None	2024-10-01 00:16:05 -05:00
Michelle Ark	a86e2b4ffc	[state:modified] store unrendered_database and unrendered_schema on source definition for state:modified comparisons (#10675 )	2024-09-30 17:50:33 +02:00
Michelle Ark	94917432f9	add model_incremental_strategy to track_model_run (#10758 )	2024-09-30 17:35:33 +02:00
Michelle Ark	d1857b39ca	state:modified vars, behind "state_modified_compare_vars" behaviour flag (#10793 )	2024-09-30 16:32:37 +02:00
Kshitij Aranke	2ff3f20863	Create `skip_nodes_if_on_run_start_fails` behavior change flag (#10699 )	2024-09-30 13:53:08 +01:00
Courtney Holcomb	5e3d418264	Add new validations for custom granularities (#10789 ) * Bump DSI to latest version to ensure mantle users have new validations for custom granularities * Changelog	2024-09-27 08:37:04 -05:00
Chenyu Li	5d32aa8b62	Revert "Fix #9005 : Allow singular tests to be documented in `properties.yml`" (#10790 ) This reverts commit `3ac20ce7a8`.	2024-09-26 17:03:10 -07:00
Quigley Malcolm	d8b1bf53f7	[CT-10785] Microbatch models should respect `full_refresh` model config (#10788 ) * Add tests to check how microbatch models respect `full_refresh` model configs * Fix `_is_incremental` to properly respect `full_refresh` model config In dbt-core, it is generally expected that values passed via CLI flags take precedence over model level configs. However, `full_refresh` on a model is an exception to this rule, where in the model config takes precedence. This config exists specifically to _prevent_ accidental full refreshes of large incremental models, as doing so can be costly. _It is actually best practice_ to set `full_refresh=False` on incremental models. Prior to this commit, for microbatch models, the above was not happening. The CLI flag `--full-refresh` was taking precedence over the model config `full_refresh`. That meant that if `--full-refresh` was supplied, then the microbatch model _would full refresh_ even if `full_refresh=False` was set on the model. This commit solves that problem. * Add changie doc for microbatch `full_refresh` config handling	2024-09-26 16:43:14 -05:00
Kshitij Aranke	1076352293	[CORE-388] Add group metadata info to `LogModelResult` and `LogTestResult` (#10775 )	2024-09-26 20:57:06 +01:00
Gerda Shank	1fe9c1bbfe	Attempt to skip saved query processing when no semantic manifest changes (#10784 )	2024-09-26 13:04:19 -04:00
Quigley Malcolm	41e4836c0f	Fix changie doc for microbatch retry functionality (#10787 ) The changie log was referencing the microbatch epic instead of the specific issues it resolved	2024-09-26 11:34:41 -05:00
Michelle Ark	b590045b9f	[state:modified] persist unrendered_config from schema.yml, and more reliably compute unrendered_config from .sql files (#10487 )	2024-09-26 16:03:40 +01:00
Quigley Malcolm	1fd4d2eae6	Enable `retry` support for Microbatch models (#10751 ) * Add `PartialSuccess` status type and use it for microbatch models with mixed results * Handle `PartialSuccess` in `interpret_run_result` * Add `BatchResults` object to `BaseResult` and begin tracking during microbatch runs * Ensure batch_results being propagated to `run_results` artifact * Move `batch_results` from `BaseResult` class to `RunResult` class * Move `BatchResults` and `BatchType` to separate arifacts file to avoid circular imports In our next commit we're gonna modify `dbt/contracts/graph/nodes.py` to import the `BatchType` as part of our work to implement dbt retry for microbatch model nodes. Unfortunately, the import in `nodes.py` creates a circular dependency because `dbt/artifacts/schemas/results.py` imports from `nodes.py` and `dbt/artifacts/schemas/run/v5/run.py` imports from that `results.py`. Thus the new import creates a circular import. Now this _shouldn't_ be necessary as nothing in artifacts should import from the rest of dbt-core. However, we do. We should fix this, but this is also out of scope for this segement of work. * Add `PartialSuccess` as a retry-able status, and use batches to retry microbatch models * Fix BatchType type so that the first datetime is no longer Optional * Ensure `PartialSuccess` causes skipping of downstream nodes * Alter `PartialSuccess` status to be considered an error in `interpret_run_result` * Update schemas and test artifacts to include new batch_results run results key * Add functional test to check that 'dbt retry' retries 'PartialSuccess' models * Update partition failure test to assert downstream models are skipped * Improve `success`/`error`/`partial success` messaging for microbatch models * Include `PartialSuccess` in status that `--fail-fast` counts as a failure * Update `LogModelResult` to handle partial successes * Update `EndOfRunSummary` to handle partial successes * Cleanup TODO comment * Raise a DbtInternalError if we get a batch run result without `batch_results` * When running a microbatch model with supplied batches, force non full-refresh behavior This is necessary because of retry. Say on the initial run the microbatch model succeeds on 97% of it's batches. Then on retry it does the last 3%. If the retry of the microbatch model executes in full refresh mode it _might_ blow away the 97% of work that has been done. This edge case seems to be adapter specific. * Only pass batches to retry for microbatch model when there was a PartialSuccess In the previous commit we made it so that retries of microbatch models wouldn't run in full refresh mode when the microbatch model to retry has batches already specified from the prior run. This is only problematic when the run being retried was a full refresh AND all the batches for a given microbatch model failed. In that case WE DO want to do a full refresh for the given microbatch model. To better outline the problem, consider the following: * a microbatch model had a begin of `2020-01-01` and has been running this way for awhile * the begin config has changed to `2024-01-01` and dbt run --full-refresh gets run * every batch for an microbatch model fails * on dbt retry the the relation is said to exist, and the now out of range data (2020-01-01 through 2023-12-31) is never purged To avoid this, all we have to do is ONLY pass the batch information for partially successful microbatch models. Note: microbatch models only have a partially successful status IFF they have both successful and failed batches. * Fix test_manifest unit tests to know about model 'batches' key * Add some console output assertions to microbatch functional tests * add batch_results: None to expected_run_results * Add changie doc for microbatch retry functionality * maintain protoc version 5.26.1 * Cleanup extraneous comment in LogModelResult --------- Co-authored-by: Michelle Ark <michelle.ark@dbtlabs.com>	2024-09-26 08:45:47 -05:00
Gerda Shank	ac66f91351	Improve performance of infer primary key (#10782 )	2024-09-25 18:16:33 -04:00
Peter Webb	359a2c0cc5	Add '--inline-direct' parameter to 'dbt show'. (#10770 ) * Add '--inline-direct' parameter to 'dbt show'. * Add changelog entry. * Update core/dbt/cli/main.py Co-authored-by: Doug Beatty <44704949+dbeatty10@users.noreply.github.com> * Add test of failure for --inline-direct --------- Co-authored-by: Kshitij Aranke <kshitij.aranke@dbtlabs.com> Co-authored-by: Doug Beatty <44704949+dbeatty10@users.noreply.github.com>	2024-09-25 12:31:05 -04:00
Quigley Malcolm	bbdb98fa5d	Microbatch Config Validation (#10752 )	2024-09-24 19:56:34 +01:00
Katsuya Shimabukuro	a8d4ba2b4a	Fix unit tests for incremental models with alias (#10755 )	2024-09-24 17:57:37 +01:00
Doug Beatty	09e973d24a	Test case for `merge_exclude_columns` (#8268 ) * Test case for `merge_exclude_columns` * Update expected output for `merge_exclude_columns` * Skip TestMergeExcludeColumns test * Enable this test since PostgreSQL 15+ is available in CI now * Undo modification to expected output	2024-09-24 08:10:44 -06:00
Michelle Ark	730e40a867	Add required 'begin' config support for microbatch models (#10756 )	2024-09-24 14:13:02 +01:00
Michelle Ark	a1e4753020	Write microbatch compiled + run code to separate target files (#10743 )	2024-09-24 14:12:44 +01:00
Kshitij Aranke	3ac20ce7a8	Fix #9005 : Allow singular tests to be documented in `properties.yml` (#10744 )	2024-09-24 13:15:03 +01:00
Michelle Ark	aa23af98e5	ignore --empty in unit test ref/source calls (#10764 )	2024-09-23 23:36:28 +01:00
Peter Webb	46da967115	Allow snapshots to be defined with YAML only. (#10762 )	2024-09-23 16:33:30 -04:00
Gerda Shank	db694731c9	Allow configuration of snapshot column names (#10608 )	2024-09-20 19:31:05 -04:00
Doug Beatty	7016cd3085	Standardize returning `ResourceTypeSelector` instances in `dbt list` and `dbt build` (#10739 ) * Remove duplicated constructor for `ResourceTypeSelector` * Add type annotation for `ResourceTypeSelector` * Standardize on constructor for `ResourceTypeSelector` where `include_empty_nodes=True` * Changelog entry	2024-09-19 16:53:16 -06:00

1 2 3 4 5 ...

6940 Commits