AI & Machine Learning OAuth 2.0

Vertex AI REST API

Google Cloud's unified AI platform for ML model deployment

Vertex AI is Google Cloud's comprehensive machine learning platform that enables developers to build, deploy, and scale ML models using pre-trained APIs or custom models. It provides a unified environment for the entire ML workflow, from data preparation and model training to deployment and monitoring, with support for AutoML, custom training, and access to foundation models like PaLM 2 and Gemini.

Base URL https://us-central1-aiplatform.googleapis.com/v1

API Endpoints

Method	Endpoint	Description
POST	`/projects/{project}/locations/{location}/publishers/google/models/{model}:predict`	Generate predictions from a deployed model or foundation model
POST	`/projects/{project}/locations/{location}/publishers/google/models/{model}:streamGenerateContent`	Stream content generation from Gemini and PaLM models
POST	`/projects/{project}/locations/{location}/endpoints`	Create a new endpoint for model deployment
GET	`/projects/{project}/locations/{location}/endpoints/{endpoint}`	Get details about a specific endpoint
POST	`/projects/{project}/locations/{location}/models`	Upload and register a custom trained model
GET	`/projects/{project}/locations/{location}/models`	List all models in a project location
POST	`/projects/{project}/locations/{location}/datasets`	Create a new dataset for training or evaluation
GET	`/projects/{project}/locations/{location}/datasets/{dataset}`	Retrieve dataset metadata and statistics
POST	`/projects/{project}/locations/{location}/trainingPipelines`	Start a model training pipeline with AutoML or custom training
GET	`/projects/{project}/locations/{location}/trainingPipelines/{pipeline}`	Get training pipeline status and results
POST	`/projects/{project}/locations/{location}/batchPredictionJobs`	Create a batch prediction job for large-scale inference
POST	`/projects/{project}/locations/{location}/endpoints/{endpoint}:deployModel`	Deploy a model to an existing endpoint
POST	`/projects/{project}/locations/{location}/endpoints/{endpoint}:undeployModel`	Undeploy a model from an endpoint
GET	`/projects/{project}/locations/{location}/modelDeploymentMonitoringJobs`	List model monitoring jobs for deployed models
POST	`/projects/{project}/locations/{location}/featurestores`	Create a feature store for ML feature management

Code Examples

curl -X POST \
  https://us-central1-aiplatform.googleapis.com/v1/projects/my-project/locations/us-central1/publishers/google/models/gemini-pro:generateContent \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{
      "role": "user",
      "parts": [{
        "text": "Explain quantum computing in simple terms"
      }]
    }],
    "generationConfig": {
      "temperature": 0.7,
      "maxOutputTokens": 1024
    }
  }'

const { GoogleAuth } = require('google-auth-library');

async function generateContent() {
  const auth = new GoogleAuth({
    scopes: 'https://www.googleapis.com/auth/cloud-platform'
  });
  const client = await auth.getClient();
  const projectId = 'my-project';
  const location = 'us-central1';
  const model = 'gemini-pro';
  
  const url = `https://${location}-aiplatform.googleapis.com/v1/projects/${projectId}/locations/${location}/publishers/google/models/${model}:generateContent`;
  
  const response = await client.request({
    url,
    method: 'POST',
    data: {
      contents: [{
        role: 'user',
        parts: [{ text: 'Explain quantum computing in simple terms' }]
      }],
      generationConfig: {
        temperature: 0.7,
        maxOutputTokens: 1024
      }
    }
  });
  
  console.log(response.data.candidates[0].content);
}

generateContent();

from google.cloud import aiplatform
from vertexai.preview.generative_models import GenerativeModel

aiplatform.init(project='my-project', location='us-central1')

model = GenerativeModel('gemini-pro')

response = model.generate_content(
    'Explain quantum computing in simple terms',
    generation_config={
        'temperature': 0.7,
        'max_output_tokens': 1024
    }
)

print(response.text)

Use Vertex AI from Claude / Cursor / ChatGPT

Get a hosted MCP endpoint for Vertex AI. Paste your Vertex AI API key, copy back one URL, drop it into Claude Desktop, Cursor, or any AI client that supports remote MCP. Your AI calls Vertex AI directly with your credentials — no local install, works on mobile.

generate_ai_content Generate text content using Gemini or PaLM foundation models with customizable parameters

deploy_ml_model Deploy a trained model to an endpoint for real-time predictions

batch_predict Run batch predictions on large datasets using deployed models

train_automl_model Start an AutoML training pipeline for tabular, image, text, or video data

monitor_model_performance Monitor deployed model performance, detect drift, and track prediction quality

Connect in 60 seconds

Paste your Vertex AI key → get an MCP URL → paste into Claude/Cursor. Hosted by IOX, encrypted at rest.

Connect Vertex AI to your AI →

Vertex AI REST API

API Endpoints

Sponsor this page

Code Examples

Use Vertex AI from Claude / Cursor / ChatGPT

Connect in 60 seconds

Related APIs