Cloud Data Fusion API

Name: Cloud Data Fusion API API
Brand: Cloud Data Fusion API
Availability: InStock

✓ Official Vendor SpecAnalyticsData Pipelinesoauth215 EndpointsREST

For Agents

Provision and lifecycle Cloud Data Fusion instances, manage DNS peerings, and list available versions. Lets agents stand up the data integration platform without going through the Cloud Console.

Quickstart

Get started with Cloud Data Fusion API in minutes using your preferred integration method.

# Add to your MCP client config (Claude Desktop, Cursor, Windsurf)
{
  "jentic": {
    "url": "https://api.jentic.com/mcp",
    "auth": "oauth"
  }
}

# Then ask your agent:
"create a cloud data fusion instance"

# → Jentic returns the GET /events tool with parameter schema, agent executes.

Capabilities

What an agent can do with Cloud Data Fusion API API.

Provision Cloud Data Fusion instances in a chosen region and edition

Restart, upgrade, and delete Data Fusion instances through the API

Configure DNS peerings so instances can resolve private network services

List available Data Fusion versions before creating an instance

GET STARTED

Start building with Cloud Data Fusion API API

Explore with Jentic

View OpenAPI Document

Use for: I need to provision a new Cloud Data Fusion instance in europe-west1, Restart an existing Data Fusion instance after a config change, List available Data Fusion versions for the Enterprise edition, Set up a DNS peering so my Data Fusion instance can reach an on-prem name server

Not supported: Does not design pipelines, run pipeline executions, or expose CDAP plugins — use for Data Fusion instance provisioning, lifecycle, and DNS peering only.

Use Cases

Patterns agents use Cloud Data Fusion API API for, with concrete tasks.

★ Programmatic Data Fusion Instance Provisioning

Stand up Cloud Data Fusion instances from infrastructure-as-code or an agent workflow rather than the Cloud Console. The API accepts edition, version, and network settings on create and exposes long-running operations to wait for the instance to reach the RUNNING state. End-to-end provisioning typically completes in 15-25 minutes once the create call returns.

Create a Cloud Data Fusion instance named 'analytics-prod' in us-central1 with edition ENTERPRISE and the latest available version, then poll the operation until state is RUNNING

Private Network DNS Peering Setup

Configure DNS peerings on a Data Fusion instance so pipelines running inside the tenant project can resolve private hostnames in the customer VPC. Without the peering, pipelines fail to reach on-prem databases, internal Cloud SQL endpoints, or private DNS zones. The API handles create, list, get, update, and delete on the dnsPeerings collection.

Create a DNS peering on instance 'analytics-prod' for domain 'corp.internal.' targeting the customer VPC's private DNS zone

Instance Lifecycle Automation in CI/CD

Spin up ephemeral Data Fusion instances for staging tests, run pipeline validation jobs, and tear them down to control cost. The API's restart, upgrade, and delete operations let CI pipelines drive the full lifecycle from a build job. Combined with version listing, this supports automated upgrade-test-promote flows.

Upgrade Data Fusion instance 'staging' to version 6.10.0 and wait for the operation to complete, then restart the instance and confirm state is RUNNING

AI Agent Provisioning Operator

An AI agent can provision and lifecycle Data Fusion instances through Jentic without the operator writing OAuth code. Jentic's intent search returns the matching create, restart, upgrade, or delete operation, the agent loads the schema, and Jentic executes against the Data Fusion control plane with vault-stored credentials. This compresses platform-engineering setup work that normally takes a developer multiple days.

Use Jentic to search 'create a data fusion instance', load the create-instance schema, and execute it for project 'acme-data', region 'us-central1', edition 'BASIC'

Key Endpoints

15 endpoints — cloud data fusion is a fully-managed, cloud-native data integration service for building and operating etl/elt pipelines through a graphical pipeline designer.

METHOD

PATH

DESCRIPTION

POST

/v1/{+parent}/instances

Create a Data Fusion instance

GET

/v1/{+parent}/instances

List Data Fusion instances in a location

POST

/v1/{+name}:restart

Restart a Data Fusion instance

POST

/v1/{+parent}/dnsPeerings

Create a DNS peering on an instance

GET

/v1/{+parent}/versions

List available Data Fusion versions

POST

/v1/{+name}:cancel

Cancel a long-running operation

POST

/v1/{+parent}/instances

Create a Data Fusion instance

GET

/v1/{+parent}/instances

List Data Fusion instances in a location

POST

/v1/{+name}:restart

Restart a Data Fusion instance

POST

/v1/{+parent}/dnsPeerings

Create a DNS peering on an instance

GET

/v1/{+parent}/versions

List available Data Fusion versions

Why though Jentic?

Three things that make agents converge on Jentic-routed access.

Credential isolation

Google OAuth client secrets and refresh tokens are stored encrypted in the Jentic vault. Agents receive scoped, short-lived access tokens for datafusion.googleapis.com; raw client credentials never enter the agent context.

Intent-based discovery

Agents search Jentic by intent (e.g. 'provision a data fusion instance') and Jentic returns the matching operation with its input schema, so the agent calls the right endpoint without browsing the discovery doc.

Time to first call

Direct Data Fusion integration: 1-3 days for OAuth, long-running-operation polling, and edition/version handling. Through Jentic: under 1 hour.

Related APIs

Alternatives and complements available in the Jentic catalogue.

Complementary

Dataflow API

Data Fusion pipelines compile to Dataflow jobs at runtime; Dataflow exposes the underlying job API.

Choose Dataflow when the agent needs to launch or monitor the actual Beam job. Use Data Fusion when the agent is provisioning the higher-level integration platform.

Alternative

Cloud Dataproc API

Dataproc runs managed Hadoop and Spark clusters; Data Fusion provides a graphical pipeline layer over comparable engines.

Choose Dataproc for code-first Spark or Hadoop workloads. Use Data Fusion when the workflow is built visually by analysts.

Complementary

Cloud Composer API

Composer (managed Airflow) often orchestrates Data Fusion instances and pipeline runs as part of a wider DAG.

Choose Composer when the agent needs cross-service orchestration. Use Data Fusion's API for instance lifecycle within that DAG.

FAQs

Specific to using Cloud Data Fusion API API through Jentic.

What authentication does the Cloud Data Fusion API use?

The Cloud Data Fusion API uses OAuth 2.0 with the cloud-platform scope. Through Jentic the OAuth client and refresh tokens live in the Jentic vault and the agent receives short-lived scoped access tokens, so raw Google credentials never enter the agent context.

Can I run pipelines through the Cloud Data Fusion API?

No — the Data Fusion control-plane API at datafusion.googleapis.com only provisions and lifecycles instances. Pipeline create, deploy, and run operations are exposed by the per-instance CDAP REST API, served from the running instance's URL once it is provisioned through this API.

What are the rate limits for the Cloud Data Fusion API?

Google enforces standard Cloud quotas on Data Fusion: per-project quotas on number of instances per edition, plus general read/write rate limits on the operations and dnsPeerings collections. Quotas are visible in the Cloud Console under IAM and admin, quotas, filtered to datafusion.googleapis.com.

How do I provision a Data Fusion instance through Jentic?

Search Jentic for 'create a data fusion instance', load the schema for POST /v1/{+parent}/instances, and execute with project, location, edition, and version. Jentic returns the long-running operation name; poll GET /v1/{+name} on it until done is true.

Is the Cloud Data Fusion API free?

API calls themselves are free, but Data Fusion instances are billed by edition (Developer, Basic, Enterprise) and runtime hours, plus underlying Dataproc and BigQuery costs incurred by pipelines. There is no per-API-call charge.

Which versions can I create an instance with?

Call GET /v1/{+parent}/versions on the location to list available Data Fusion versions before creating an instance. The response includes version IDs and default flag, so an agent can pick the latest stable version programmatically rather than hard-coding it.