🚀 [firebase_ai] Ability to stop a streaming request mid-flight

## What feature would you like to see?

A way to truly cancel an in-flight `sendMessageStream` / `generateContentStream` call so that the model stops generating on the server side — not just on the client.

Today, cancelling the `StreamSubscription` only stops the client from reading new chunks. The HTTPS connection stays open and the model keeps generating (and billing) tokens up to `maxOutputTokens`. That makes a real "Stop generating" button impossible in chat UIs: pressing Stop freezes the UI but doesn't actually save anything server-side.

## Why this matters

For a chat surface with tool-call chains (Gemini Flash, `maxOutputTokens: 8192`, recursive tool dispatcher), a single runaway prompt can burn 100k+ output tokens server-side after the user has visually "stopped" the chat. At scale that's real cost, and product-wise it means apps can't honestly ship a Stop button.

## Proposed minimal change

The root cause is that `package:http` itself doesn't yet support cancellation ([dart-lang/http#204](https://github.com/dart-lang/http/issues/204)) — that one is on the wider Dart team and will take time. **But firebase_ai's internal `Client` already accepts an `http.Client?`** ([client.dart:45](https://github.com/firebase/flutterfire/blob/main/packages/firebase_ai/firebase_ai/lib/src/client.dart#L45)), and so do `createGenerativeModel` and the internal `GenerativeModel` ctor. The public factory just doesn't pass it through:

```dart
// firebase_ai.dart
GenerativeModel generativeModel({
  required String model,
  // ... existing params ...
  http.Client? httpClient,   // 👈 add this
}) =>
    createGenerativeModel(
      // ... existing args ...
      httpClient: httpClient,
    );
```

That one-line plumbing change immediately unblocks the workaround: callers can inject a cancellable wrapper like [`cancellation_token_http`](https://pub.dev/packages/cancellation_token_http) and call `.close()` on cancel. Closing the socket signals Gemini to stop generating, so server-side billing actually stops.

No breaking changes; existing callers see no difference.

## Workarounds considered

1. Forking firebase_ai locally just to add the param — works but adds maintenance burden.
2. Using `package:google_generative_ai` — incompatible with Firebase App Check + Firebase auth.
3. Calling Vertex AI REST directly — loses tool-call schema mapping, function-call parsing, App Check token injection.

## Environment

- `firebase_ai: 3.11.0`
- Flutter 3.x (web + iOS + Android)
- Gemini 2.5 Flash via `FirebaseAI.googleAI()`



BTW, you rock 🤘🔥🪽

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 [firebase_ai] Ability to stop a streaming request mid-flight #18309

What feature would you like to see?

Why this matters

Proposed minimal change

Workarounds considered

Environment

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

🚀 [firebase_ai] Ability to stop a streaming request mid-flight #18309

Description

What feature would you like to see?

Why this matters

Proposed minimal change

Workarounds considered

Environment

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions