Qlik Talend Cloud (QTC) recently introduced an important feature: Cross Pipeline Projects. This addition significantly improves how data engineers build and share pipeline architectures, making data initiatives more efficient and collaborative.
This practical guide will show you:
Before Cross Pipeline Projects, QTC users faced a significant constraint: data processed in one project couldn't be easily used in another without re-storing and re-registering that data. This created unnecessary duplication and made collaboration between data engineering teams difficult.
For organisations with multiple data sources and teams, this limitation forced inefficient workflows and slowed development.
Let's explore a straightforward implementation connecting Github data to Snowflake using QTC's Cross Pipeline Projects feature.
Prerequisites
For this example, I've already:
There are 2 ways to land data using QTC:
For ingest, you would typically use a replicate project, but this method doesn't allow us to use the output in other QTC projects without re-storing & re-registering the data. So we will be using a pipeline project for this example.
The initial project focuses on extracting and landing data:
The 'Onboarding Task' is where Cross Pipeline Projects truly shows its value:
The onboarding process automatically creates several Snowflake objects:
------------------------------------------------------------------------------------
Here's where the practical benefit becomes clear. In a separate project:
The transform output within the OMETIS_PIPELINES database in the GITHUB schema produces:
2 Tables (because the author chose to materialize in the QTC settings)
3 Views (the main output view, as well as changes and with deleted records)
As shown in the screenshot, this creates:
GITHUB_TRANSFORM_1_ct and GITHUB_TRANSFORM_1_current tables
GITHUB_TRANSFORM_1, GITHUB_TRANSFORM_1_changes, and GITHUB_TRANSFORM_1_whdr views
This Cross Pipeline Projects architecture provides several benefits:
After implementation, your Snowflake instance will contain (as shown in the screenshot):
In the OMETIS_LANDING database with GITHUB schema:
In the OMETIS_PIPELINES database with GITHUB schema:
This structured approach creates a clean data flow from ingestion through transformation, with no redundant data registration steps.
While our example uses Github data, this pattern works well for various data sources:
This approach scales well with more data sources and more complex transformations without losing performance or clarity.
Qlik Talend Cloud's Cross Pipeline Projects feature represents an important improvement in how data engineers can collaborate. By eliminating the need to re-register data between projects, QTC now supports more efficient data architectures.
For organisations with multiple data teams, this feature enables better collaboration, clearer data lineage, and more manageable data pipelines.
Need help implementing Cross Pipeline Projects in your Qlik environment?
Book a call with our Qlik integration specialists to discuss how we can help you leverage this and other QTC features for your specific business needs.