OllamaModelBackend.generate_from_raw silently swallows batch exceptions

## `OllamaModelBackend.generate_from_raw` silently swallows batch exceptions

### Description

`generate_from_raw` uses `asyncio.gather(..., return_exceptions=True)` to run concurrent requests, but silently converts any exception into `ModelOutputThunk(value="")`, storing the error only in `result._generate_log.extra["error"]` — invisible to callers.

[ollama.py:465-474](https://github.com/generative-computing/mellea/blob/main/mellea/backends/ollama.py#L465-L474)

### Impact

Callers have no way to detect failures. Tests asserting `result.value is not None` pass silently even when requests are failing, since empty string is not None.

### Related

- #432 — streaming path exception handling (fixed by #580)
- #573 — test flakiness where this manifested


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OllamaModelBackend.generate_from_raw silently swallows batch exceptions #597

`OllamaModelBackend.generate_from_raw` silently swallows batch exceptions

Description

Impact

Related

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

OllamaModelBackend.generate_from_raw silently swallows batch exceptions #597

Description

OllamaModelBackend.generate_from_raw silently swallows batch exceptions

Description

Impact

Related

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`OllamaModelBackend.generate_from_raw` silently swallows batch exceptions