Getting started with the Claude API

What you’ll have at the end#

A working “hello, Claude” request from your laptop, using either Python or TypeScript. Plus a billing cap so a runaway script can’t drain your card.

Step 1 — Get an API key#

Sign in to platform.claude.com (Anthropic’s developer console — historically called console.anthropic.com, which still redirects).
Add a payment method (Settings → Billing → Add payment method). You can’t make requests without one. Anthropic gives small free credits for new accounts, but they expire.
Set a spend limit. Settings → Limits → set a monthly cap. Do this BEFORE the next step. If a script goes wrong, this is the line between “annoyance” and “expensive Sunday.”
Create a key. Settings → API Keys → Create Key. Name it for the project (claude-api-laptop-2026-05). Copy the sk-ant-... value — you’ll only see it once.

Set it in your shell:

# macOS / Linux — add to ~/.zshrc or ~/.bashrc
export ANTHROPIC_API_KEY=sk-ant-...

# Windows PowerShell — persistent
[Environment]::SetEnvironmentVariable("ANTHROPIC_API_KEY", "sk-ant-...", "User")

Restart your shell. Confirm it’s set without printing the secret — test -n "$ANTHROPIC_API_KEY" && echo "set" on macOS/Linux, or if ($env:ANTHROPIC_API_KEY) { "set" } in PowerShell. (Avoid echo $ANTHROPIC_API_KEY; that prints the live secret into your shell history and any screen-sharing window.)

⚠️ Never commit the key. Add .env* and *.secret to .gitignore. If a key leaks, revoke it on platform.claude.com immediately. Treat it like a database password.

Step 2 — Install an SDK#

Python#

pip install anthropic

TypeScript / Node#

npm install @anthropic-ai/sdk
# or pnpm add @anthropic-ai/sdk / bun add @anthropic-ai/sdk

Step 3 — Make your first call#

Python (sync)#

import anthropic

client = anthropic.Anthropic()  # picks up ANTHROPIC_API_KEY from env

message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=200,
    messages=[
        {"role": "user", "content": "Reply with exactly: hello from the API"}
    ],
)

print(message.content[0].text)

Run it: python hello.py → you should see hello from the API (or close to it; the model rarely deviates from instructions this specific).

TypeScript#

import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic();  // picks up ANTHROPIC_API_KEY from env

const message = await client.messages.create({
  model: 'claude-sonnet-4-6',
  max_tokens: 200,
  messages: [
    { role: 'user', content: 'Reply with exactly: hello from the API' },
  ],
});

console.log(message.content[0].type === 'text' ? message.content[0].text : '');

Run it: tsx hello.ts → same output.

Step 4 — Read the response#

The message object has more than just text. The shape:

{
  "id": "msg_01ABC...",
  "model": "claude-sonnet-4-6-...",
  "role": "assistant",
  "stop_reason": "end_turn",
  "content": [
    { "type": "text", "text": "hello from the API" }
  ],
  "usage": {
    "input_tokens": 24,
    "output_tokens": 7
  }
}

Worth understanding:

content is an array of blocks. Most simple cases have one text block. With tool use, you’ll see tool_use blocks mixed in.
stop_reason tells you why the model stopped. end_turn (clean finish), max_tokens (you hit the limit), tool_use (model is asking you to run a tool), stop_sequence (custom stop sequence matched).
usage is your billing signal. Multiply input/output tokens × current pricing → cost of this call. Log it.

Step 5 — Add a system prompt#

message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=200,
    system="You are a concise technical writer. No marketing language.",
    messages=[
        {"role": "user", "content": "Define MCP in two sentences."}
    ],
)

The system prompt sets the assistant’s persona / constraints for the whole conversation. Use it for “voice” instructions, output format rules, and “what not to do” guards.

Step 6 — Streaming (felt-latency win)#

with client.messages.stream(
    model="claude-sonnet-4-6",
    max_tokens=500,
    messages=[{"role": "user", "content": "Explain RAG."}],
) as stream:
    for text in stream.text_stream:
        print(text, end="", flush=True)

Tokens appear as they’re generated. For UIs, this dramatically improves perceived latency — readers see content start arriving immediately rather than staring at a spinner while the full response is composed.

Step 7 — Cost guardrails#

Before you put this in production:

Spend Limits set (Step 1.3 — but double-check)
Budget alerts — Console → Billing → Alert when monthly spend hits N% of cap
Per-request tracking — log message.usage somewhere queryable. A simple file or a real DB; either works
Retry logic with backoff — Anthropic rate-limits per-minute. Don’t hammer; the SDKs have built-in retry
Timeout — set timeout=30 on the client. Without it, a stuck connection waits forever

Common pitfalls#

Symptom	Cause	Fix
`AuthenticationError`	Key wrong, expired, or revoked	Check `echo $ANTHROPIC_API_KEY`; create a new key if needed
`BadRequestError: model not found`	Model name has changed or is region-restricted	Check `docs.anthropic.com/en/docs/about-claude/models` for current names
Empty response	`max_tokens` too low for the answer	Bump `max_tokens`; check `stop_reason`
Slow response	First call to a model after a long idle period (cold start)	Acceptable on first request; should not persist
429 rate limit	Too many concurrent requests	Add retry-with-backoff (SDK does this automatically by default)
Cost much higher than expected	Verbose responses, repeated system prompts (no caching)	Check usage logs; add prompt caching if system prompt is large

What to do next#

§API.3 Models — the current lineup and what each is good for
§API.4 Common patterns — tool use, structured outputs, prompt caching
§MCP.1 What is MCP — once you’ve built one app with tools, MCP is how you reuse them

`⌘` + `K` · `/`	open search
`j`	next entry (within section)
`k`	previous entry
`g` `h`	go to home
`g` `m`	go to methodology
`?`	show this help
`esc`	close any modal