
Azure Data Factory is a cloud-based data integration service that simplifies the process of integrating data from various sources.
With Azure Data Factory, you can create data pipelines that move data between different systems, applications, and services.
This allows for real-time data integration and analytics, making it easier to make informed business decisions.
Azure Data Factory also supports a wide range of data sources, including Azure Blob Storage, Azure Data Lake Storage, and many more.
By using Azure Data Factory, you can automate data integration tasks and reduce the risk of human error.
Azure Data Factory
Azure Data Factory is a powerful tool for building cloud-native data engineering pipelines. It has a lot of source connectors available, and this list is overgrowing.
Azure Data Factory uses Integration Runtimes as a computing infrastructure to execute data movement and transformation activities. There are three types of Integration Runtimes in ADF: Azure, Self-Hosted, and Azure-SSIS.
The Azure Integration Runtime is the default one that Azure fully manages, suitable for connecting to Azure resources. The Self-Hosted Integration Runtime can connect to on-prem sources and secure data movement between systems. Azure-SSIS Integration Runtime is used for running SSIS packages in Azure Data Factory.
To set up a Self-Hosted Integration Runtime, you need an Azure subscription and a resource group with Azure Data Factory. You also need to set up a Self-Hosted Integration Runtime in Azure Data Factory.
Here are the steps to share a Self-Hosted Integration Runtime:
1. Select the integration runtime to share.
2. Click "Grant Permission to another Data Factory or user-assigned managed identity".
3. Grant access to the target Azure Data Factory.
4. Click "New" and select "Azure, self-hosted".
5. Under "External Resources", select "Linked Self-Hosted" and click "Continue".
By following these steps, you can share a Self-Hosted Integration Runtime between multiple Azure Data Factory instances.
Integration Runtime
Integration Runtime is a crucial component in Azure Data Factory, and it's essential to understand the different types and how they work. There are three types of Integration Runtime in ADF: Azure integration runtime, self-hosted integration runtime, and Azure-SSIS integration runtime.
Azure integration runtime is the default one that Azure fully manages, suitable for connecting to Azure resources, and data movement is in the public network. Self-hosted integration runtime, on the other hand, can connect to on-prem sources and secure data movement between systems, supporting data movement in the private network. Azure-SSIS integration runtime is used for running SSIS packages in Azure Data Factory.
To set up a self-hosted integration runtime, you need to install a software on standalone nodes within your private network. This runtime can be configured with a service key for authentication with ADF, and you can install it on multiple servers for active-active scalability. With self-hosted integration runtime, you don't have to worry about network configuration, unlike with Azure Integration Runtime.
Here are the three types of Integration Runtime in ADF:
To share a self-hosted integration runtime with another Azure Data Factory, you need to select the integration runtime, click edit, copy the ResourceID, and grant permission to the target Azure Data Factory.
Frequently Asked Questions
What does shir stand for?
SHIR stands for Self-Hosted Integration Runtime, a virtual machine that enables data integration and processing in Azure Data Factory and Azure Synapse. Learn how to monitor and manage SHIR virtual machines for optimal performance.
Sources
- https://community.databricks.com/t5/data-engineering/databricks-sql-endpoint-as-linked-service-in-azure-data-factory/td-p/71477
- https://www.snowflake.com/en/blog/snowflake-connector-for-azure-data-factory-adf/
- https://www.madeiradata.com/post/azure-data-factory-self-hosted-integration-runtime-what-is-it-and-when-to-use-it
- https://azureops.org/articles/shared-integration-runtime-in-azure-data-factory/
- https://key2consulting.com/azure-integration-runtime-azure-data-factory/
Featured Images: pexels.com