Skip to content

Commit fa7f508

Browse files
committed
Release 5.0 - Welcome Marketing! ๐Ÿ“บ
1 parent 19a36e0 commit fa7f508

File tree

172 files changed

+14914
-3008
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

172 files changed

+14914
-3008
lines changed

โ€Ž.gitmodules

+4
Original file line numberDiff line numberDiff line change
@@ -14,3 +14,7 @@
1414
path = src/SFDC
1515
url = ../cortex-salesforce
1616
branch = main
17+
[submodule "src/marketing"]
18+
path = src/marketing
19+
url = ../cortex-marketing
20+
branch = main

โ€Ž1_click.sh

+26
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
#!/bin/bash
2+
#
3+
# Copyright 2023 Google LLC
4+
#
5+
# Licensed under the Apache License, Version 2.0 (the "License");
6+
# you may not use this file except in compliance with the License.
7+
# You may obtain a copy of the License at
8+
#
9+
# https://www.apache.org/licenses/LICENSE-2.0
10+
#
11+
# Unless required by applicable law or agreed to in writing, software
12+
# distributed under the License is distributed on an "AS IS" BASIS,
13+
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
14+
# See the License for the specific language governing permissions and
15+
# limitations under the License.
16+
#
17+
# 1-Click Deployment Launcher
18+
19+
SCRIPT_DIR=$( cd -- "$( dirname -- "${BASH_SOURCE[0]}" )" &> /dev/null && pwd )
20+
21+
pushd "${SCRIPT_DIR}" 1> /dev/null
22+
echo -n "Please wait..."
23+
git submodule update --init --recursive &> /dev/null || true
24+
echo -e -n "\r \r"
25+
src/utils/interactive/interactive.sh "${1}" "${2}"
26+
popd 1> /dev/null

โ€ŽREADME.md

+562-497
Large diffs are not rendered by default.

โ€ŽRELEASE_NOTES.md

+29-3
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,37 @@
1+
## July 2023 - Release 5.0
2+
* **New Marketing models:** A new repository, [Cortex for Marketing](https://github.com/GoogleCloudPlatform/cortex-marketing), has been added to the Data Foundation. This repository starts with data ingestion and data processing DAGs for Cloud Composer and Dataflow and predefined data models for Google Ads and Campaign Manager 360. This accelerates Ads reporting scenarios like keyword performance insights across campaigns and audience insights across display campaigns directly in BigQuery. Please check the [ERDs in the docs](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main/docs) folder.
3+
* **Quick demo deployment**: For those looking for a frictionless demo deployment experience, we have created a button that will guide them through an automated process to create sample datasets with test data and enable APIs and permissions. This is available in the [README](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main#quick-demo-setup).
4+
* **Cross-workload and source reusable models** (a.k.a, **K9**, where the DAGs ๐Ÿถ live): Reusable models, such as time dimensions or external sources like Weather, are now available through a deployment mechanism that is shared across all datasets. This allows for cross workload reporting, like joining SAP and Google Ads data too. Some DAGs that used to be deployed as workload-specific, like holiday calendar to SAP, is not migrated to the dataset Please check the migration guide in [the docs folder](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main/docs/external_dag_migration). Weather and Trends extraction DAGs are now disabled by default. More information about the K9 module in the [README](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main#configure-k9-deployments).
5+
* [Optional materialization and performance optimization](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main#performance-optimization-for-reporting-views) functionality is enabled for all modules and views. This impacts the SQL templates previously containing a `CREATE` statement. ๐Ÿšจ๐Ÿšจ We strongly recommend checking the new default configurations as this will attempt to replace views with tables. ๐Ÿšจ๐Ÿšจ You will need to delete the existing views in Reporting before you can deploy a table with the same name. See the documentation for more details.
6+
* [CATGAP](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main/src/k9/src/catgap) has moved to K9. By default, the deployment is disabled.
7+
* The test harness data has now moved from buckets into BigQuery datasets available in all relevant regions. Test harness data is still not provided with any warranty of quality, but this change simplifies and speeds up the deployment for those exploring the framework.
8+
* Substitutions are removed, except for Logs GCS bucket. Configuration files are now normalized into `config.json`. Legacy Dotenv files have been removed. Check [the updated documentation](https://github.com/GoogleCloudPlatform/cortex-data-foundation/tree/main#configure-deployment) to understand parameters.
9+
* deployCDC is now specific to each workload
10+
* generateExtraData is now removed, the control of this deployment is owned by K9.
11+
### **SAP**
12+
- SAP-specific DAGs, like currency conversion, inventory management or hierarchy reader, were moved to the reporting module and are executed by default. This prevents downstream errors that used to happen when _GEN_EXT was flagged false.
13+
- Please check the [reporting configuration for materialization](https://github.com/GoogleCloudPlatform/cortex-reporting/blob/main/reporting_settings_ecc.yaml), these have been modified to better match customer expectations but may not apply to your business at all.
14+
* UNION deployment logic has been removed, but supported views will continue to have the UNION template until further notice.
15+
- The reporting module relies on the time dimension deployment in K9, it can be executed manually if deploying from the [cortex-reporting submodule](https://github.com/GoogleCloudPlatform/cortex-reporting/).
16+
17+
### **Salesforce**
18+
- New currency conversion for Salesforce. Since these are optional components for Salesforce deployments, they can be created through the placeholder settings. If you don't have the currency types and conversion in Salesforce, comment out or remove the calls in `SFDC/config/ingestion_settings.yaml`. The configuration in `cdc_placeholder_settings.yaml` will take care of the dependencies in the reporting views.
19+
- The reporting module relies on the time dimension deployment in K9, it can be executed manually if deploying from the [Salesforce submodule](https://github.com/GoogleCloudPlatform/cortex-salesforce).
20+
21+
### Known issues and limitations
22+
- SAP Reporting and Salesforce Reporting requires execution of K9 pre-processing if used as a separate submodule.
23+
- ML Models repository still allows for substitutions.
24+
- CM360 and Ads integration scripts are in `Experimental` mode while we gather early feedback from customers on usage. We would like to hear what you think through GitHub issues.
25+
- Google Ads DAG is supported on Airflow v2 only.
26+
127
## March 2023 - Release 4.2
228
* **New Supply Chain and Finance views**: These views support KPIs for Inventory management, Vendor Performance, Accounts Payable and Spend Analysis. Please check the updated [ECC ERD](https://github.com/GoogleCloudPlatform/cortex-data-foundation/blob/main/docs/erd_ecc.pdf) and the [S/4 ERD](https://github.com/GoogleCloudPlatform/cortex-data-foundation/blob/main/docs/erd_s4.pdf) The new objects are:
329
- **Reusable DAG scripts for inventory snapshots**: ([SAP_CDC/external_dag/inventory_snapshots](https://github.com/GoogleCloudPlatform/cortex-dag-generator/tree/main/src/external_dag/inventory_snapshots)): Initializer and periodic update for weekly and monthly inventory snapshots and aggregations, slow moving threshold, and stock characteristic configuration. Please check for CORTEX-CUSTOMER comments for potential updates needed for custom thresholds and material movements. These scripts are in PREVIEW and will be moved into another structure in deployment in the next major release. โš ๏ธโš ๏ธ These scripts fill the base tables and need to be scheduled for the reporting views to function properly.โš ๏ธโš ๏ธ
430
- **New views for reporting**: AccountsPayableOverview, DaysPayableOutstanding, InventoryByPlant, InventoryKeyMetrics, MaterialLedger, MaterialMovementTypesMD
531
MaterialsBatchMD, POScheduleLine, PurchaseDocumentsHistory, SalesOrderStatus, SlowMovingThreshold, StockCharacteristicsConfig, StockInHand, StockMonthlySnapshots, StockWeeklySnapshots.
632
- **Overview views**: VendorLeadTimeOverview, VendorPerformanceOverview. These views show the reporting logic used in Looker in case you want to replicate them in another tool or a microservice. These views are not deployed by default.
733
* **Materializer PREVIEW** ๐Ÿซถ: By default, the new views that will require a lot of computation are now deployed as materializing DAGs. This helps improve performance and reduce costs and is configurable. This configuration is optional and the generated SQL can be ported into a scheduler of choice if you are not using Cloud Composer or Airflow. The next major release will follow this deployment approach for all views. See the [documentation](https://github.com/GoogleCloudPlatform/cortex-data-foundation/blob/main/README.md#optional-sap-only_performance-optimization-for-reporting-views) for more details.
8-
* **Cortex Analytics Templates - Google Ads Pipelines (CATGAP)** ๐Ÿˆ: This new experimental feature uses Natural Language Processing machine learning models to intelligently map product categories from Google Ads to SAP's product hierarchy. We'd love to know what you think. CATGAP is not deployed by default. Please check [the documentation](https://github.com/GoogleCloudPlatform/cortex-dag-generator/tree/main/src/external_dag/catgap/README.md) for details and further setup.
34+
* **Cortex Analytics Templates - Google Ads Pipelines (CATGAP)** ๐Ÿˆ: This new experimental feature uses Natural Language Processing machine learning models to intelligently map product categories from Google Ads to SAP's product hierarchy. We'd love to know what you think. CATGAP is not deployed by default.Please check [the documentation](https://github.com/GoogleCloudPlatform/cortex-dag-generator/tree/main/src/external_dag/catgap/README.md) for details and further setup.
935
* โš ๏ธโš ๏ธNOTEโš ๏ธโš ๏ธ Reporting views that expect parameters from currencies in config/config.json will produce the same result as many times as currencies are set as targets. Currency conversion in newer views is no longer commented out for convenience. However, the target currency needs to be passed as a filter from the reporting view. ๐Ÿ™๐Ÿ™ Please check for `CORTEX-CUSTOMER` comments for specific guidance if you deploy the data foundation with more than one currency.๐Ÿ™๐Ÿ™
1036
* Parameters that control runtime (e.g, "DEPLOY_SAP", "DEPLOY_SFDC", "DEPLOY_CDC" and "TEST_DATA") are now also read from the config file. These are still defaulted in cloudbuild.yaml substitutions. If you want to use the values in the file, the [substitutions section in cloudbuild.yaml](https://github.com/GoogleCloudPlatform/cortex-data-foundation/blob/main/cloudbuild.yaml#L107) needs to be commented out. Substitutions from the command line will be phased out.
1137
* Compatibility for Airflow v1 and v2 updated for currency_conversion DAG.
@@ -36,7 +62,7 @@
3662
* Salesforce integration DAGs have been tested in Airflow 1.0. Airflow 2.0 may require library updates or use of backwards compatible libraries to be tested and confirmed in the next release.
3763
* Finance views for SAP are good candidates for partial or total materialization. Check BigQuery's execution details to identify opportunities to create materialization processes and further optimizations that fit your data best.
3864
## November 2022 - Release 3.1
39-
* **New partitioning and clustering configuration for CDC deployment:** Configurable partitioning and clustering on deployment of CDC landing tables and scripts. See example in [setting.yaml](https://github.com/GoogleCloudPlatform/cortex-dag-generator/blob/main/setting.yaml) and [the README instructions](https://github.com/GoogleCloudPlatform/cortex-data-foundation#performance-optimization-for-cdc-tables).
65+
* **New partitioning and clustering configuration for CDC deployment:** Configurable partitioning and clustering on deployment of CDC landing tables and scripts. See example in [cdc_settings.yaml](https://github.com/GoogleCloudPlatform/cortex-dag-generator/blob/main/cdc_settings.yaml) and [the README instructions](https://github.com/GoogleCloudPlatform/cortex-data-foundation#performance-optimization-for-cdc-tables).
4066
* **New date dimension ๐Ÿ“…** generated from external DAGs through `_GEN_EXT=true` to allow for more flexibility in reporting. This table has been incorporated into views POSchedule, MaterialsValuation, PurchaseDocuments, Deliveries, Billing, SalesOrderScheduleLine, AccountingDocuments, InvoiceDocuments_Flow, POOrderHistory and SalesOrders_V2. The straucture of the table will be generated without data if `_GEN_EXT=false`.
4167
**Note:** If you do not want to execute the generation of other DAGs for external sources, remove them from the list in the [generation script](https://github.com/GoogleCloudPlatform/cortex-dag-generator/blob/main/generate_external_dags.sh#L4).
4268
* Performance improvements to AccountingDocuments, InvoiceDocuments_Flow,Billing, Deliveries, MaterialsValuation, PurchaseDocuments and POOrderHistory, SalesOrders_V2 when using currency conversion generated from DAG materializing results (see CORTEX-CUSTOMER tags and differences when merging).
@@ -136,7 +162,7 @@ This release brings the following changes Cortex Data Foundation.
136162

137163

138164
**Bug Fixes:**
139-
* Fix for Runtime views not generated from `setting.yaml`
165+
* Fix for Runtime views not generated from `settings.yaml`
140166
* Deployment dependencies causing errors on view creation
141167

142168

0 commit comments

Comments
ย (0)