Skip to main content
Version: Next

Microsoft Fabric

Certified

Azure Fabric source for DataHub metadata ingestion

CLI based Ingestion

Config Details

Note that a . is used to denote nested fields in the YAML recipe.

FieldDescription
batch_size
integer
Default: 100
check_timeout_sec
integer
Default: 120
enable_dataflow_extraction
boolean
Default: True
enable_dataset_discovery
boolean
Default: True
enable_job_extraction
boolean
Default: True
enable_lineage_discovery
boolean
Default: True
enable_profiling
boolean
Default: False
enable_usage_stats
boolean
Default: True
incremental_lineage
boolean
Default: True
platform_instance
string
The instance of the platform that all assets produced by this recipe belong to. This should be unique within the platform. See https://datahubproject.io/docs/platform-instances/ for more details.
profiling_sample_size
integer
Default: 1000
retry_count
integer
Default: 3
workspace_name
string
Workspace name
Default:
workspace_url
string
Workspace URL
Default:
env
string
The environment that all assets produced by this connector belong to
Default: PROD
azure_config
AzureConnectionConfig
Azure configuration
azure_config.account_key
string
Azure storage account access key.
azure_config.account_name
string
Name of the Azure storage account. See Microsoft official documentation on how to create a storage account.
azure_config.base_path
string
Base folder in hierarchical namespaces to start from.
Default: /
azure_config.client_id
string
Azure client (Application) ID for service principal auth.
azure_config.client_secret
string
Azure client secret for service principal auth.
azure_config.container_name
string
Azure storage account container name.
azure_config.sas_token
string
Azure storage account SAS token.
azure_config.tenant_id
string
Azure tenant ID required for service principal auth.
azure_config.use_cli_auth
boolean
Whether to authenticate using the Azure CLI.
Default: False
azure_config.use_managed_identity
boolean
Whether to use Azure Managed Identity authentication.
Default: False
stateful_ingestion
StatefulIngestionConfig
Stateful Ingestion Config
stateful_ingestion.enabled
boolean
Whether or not to enable stateful ingest. Default: True if a pipeline_name is set and either a datahub-rest sink or datahub_api is specified, otherwise False
Default: False

Code Coordinates

  • Class Name: datahub.ingestion.source.ms_fabric.source.AzureFabricSource
  • Browse on GitHub

Questions

If you've got any questions on configuring ingestion for Microsoft Fabric, feel free to ping us on our Slack.