bug: Amazon Q CLI fails for all model calls when a really large file is added to context

### Checks

- [x] I have searched [github.com/aws/amazon-q-developer-cli/issues](https://github.com/aws/amazon-q-developer-cli/issues?q=) and there are no duplicates of my issue
- [x] I have run `q doctor` in the affected terminal session
- [x] I have run `q restart` and replicated the issue again

### Operating system

MacOS (Sequoia 15.4)

### Expected behaviour

## Bug: Amazon Q CLI fails with ValidationException when large file is added to context

### Description
When using Amazon Q CLI, adding a large file to the context causes all model inference operations to fail with a ValidationException error. The CLI commands that don't require model inference (like `/context show`) continue to work, but any attempt to chat with the model results in an error.

### Expected behavior
Amazon Q should either:
- Successfully process the file if it's within acceptable limits
- Return a clear error message about token size limitations before attempting inference
- Gracefully handle the oversized file without breaking model inference capabilities

### Actual behavior
All requests requiring model inference fail with a ValidationException error after adding a large file to context. Commands that don't require model inference (like `/context show`) continue to work normally.

### Context information

> /context show

🌍 global:
    .amazonq/rules/**/*.md
    README.md
    AmazonQ.md

👤 profile (default):
    amazon_q/*.md     ./t.md (1 match)

1 matched file in use:
👤 [~295900 tokens]    /Users/m.walmsley/dev/open-source/test/./t.md

Total: ~295900 tokens


> hello

Amazon Q is having trouble responding right now:
   0: unhandled error (ValidationException)
   1: service error
   2: unhandled error (ValidationException)
   3: Error { code: "ValidationException", message: "Improperly formed request.", aws_request_id: "a6641e95-548e-4f5a-a7a0-e8cd23e7c6e3" }

Location:
   crates/q_cli/src/cli/chat/mod.rs:668

   BACKTRACE
   1: backtrace::backtrace::trace::h976c2bd252d3a769
      at <unknown source file>:<unknown line>
   2: backtrace::capture::Backtrace::new::h61944a637b13bacd
      at <unknown source file>:<unknown line>
   3: color_eyre::config::EyreHook::into_eyre_hook::{{closure}}::he363a1d13cffc841
      at <unknown source file>:<unknown line>
   4: eyre::error::<impl core::convert::From<E> for eyre::Report>::from::h24ff12ad7df51aa1
      at <unknown source file>:<unknown line>
   5: q_cli::cli::chat::ChatContext<W>::try_chat::{{closure}}::h61eb9def6e01d349
      at <unknown source file>:<unknown line>
   6: q_cli::cli::chat::chat::{{closure}}::h75365372f3391e05
      at <unknown source file>:<unknown line>
   7: q_cli::cli::Cli::execute::{{closure}}::h39c461beca91f57e
      at <unknown source file>:<unknown line>
   8: q_cli::main::{{closure}}::h8cf890f0b810cff0
      at <unknown source file>:<unknown line>
   9: tokio::runtime::scheduler::current_thread::CoreGuard::block_on::h69e1a9c9f94f69df
      at <unknown source file>:<unknown line>
  10: tokio::runtime::context::runtime::enter_runtime::h309a865907b44753
      at <unknown source file>:<unknown line>
  11: tokio::runtime::runtime::Runtime::block_on::h8f3a850c0b7753b1
      at <unknown source file>:<unknown line>
  12: q_cli::main::ha217496dbe4ef1e3
      at <unknown source file>:<unknown line>
  13: std::sys::backtrace::__rust_begin_short_backtrace::hc7cc976bfd4313cb
      at <unknown source file>:<unknown line>
  14: std::rt::lang_start::{{closure}}::hd2ca6dad83ca0cb5
      at <unknown source file>:<unknown line>
  15: std::rt::lang_start_internal::hacda2dedffd2edb4
      at <unknown source file>:<unknown line>
  16: _main<unknown>
      at <unknown source file>:<unknown line>

Run with COLORBT_SHOW_HIDDEN=1 environment variable to disable frame filtering.

### Environment
- Operating System: macOS
- Amazon Q CLI version: 1.7.3
- File size that triggered the issue: ~800KB
- Token count that triggered the issue: ~296K tokens
- Total context tokens: ~298,860 tokens

### Additional context
This issue significantly impacts productivity as users cannot use large reference files in their workflows. The error message "Improperly formed request" is not helpful in diagnosing the actual issue, which appears to be related to context token limitations.

### Possible solutions
- Implement proper token size validation with clear error messages before attempting to process
- Add graceful error handling that preserves model inference capabilities even when token limits are exceeded
- Provide documentation on maximum context size limitations
- Improve error messages to clearly indicate when the issue is related to context size
- Consider implementing automatic file chunking, summarization, or truncation for large files
- Add a warning when approaching token limits during context addition


### Actual behaviour

<img width="1728" alt="Image" src="https://github.com/user-attachments/assets/0f07dfce-cbeb-4f68-979d-77468cfc8af7" />

### Steps to reproduce


1. Install and configure Amazon Q CLI
2. Start a new conversation with `q chat`
3. Add a large file to the context using `/context add <large-file>` (in my case, a file of ~800KB containing ~296K tokens)
4. Attempt to make any request that requires model inference (like a simple "hello")


### Environment

```yaml
[q-details]
version = "1.7.3"
hash = "3e4ae79d371315e80ddac772b43fff2cba314104"
date = "2025-04-10T05:48:16.435186Z (6d ago)"
variant = "full"

[system-info]
os = "macOS 15.4.0 (24E248)"
chip = "Apple M2 Max"
total-cores = 12
memory = "32.00 GB"

[environment]
cwd = "/Users/USER/dev/open-source/test"
cli-path = "/Users/USER/dev/open-source/test"
os = "Mac"
shell-path = "/bin/zsh"
shell-version = "5.9"
terminal = "iTerm 2"
install-method = "brew"

[env-vars]
PATH = "/Users/USER/tools:/Users/USER/bin:/Users/USER/.pyenv/shims:/Users/USER/.nvm/versions/node/v22.4.0/bin:/opt/homebrew/bin:/opt/homebrew/sbin:/usr/local/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/usr/local/go/bin:/Users/USER/tools:/Users/USER/bin:/Users/USER/.nvm/versions/node/v22.4.0/bin:/Applications/iTerm.app/Contents/Resources/utilities:/Users/USER/Library/Application Support/JetBrains/Toolbox/scripts:/Users/USER/.local/bin:/Users/USER/Library/Application Support/JetBrains/Toolbox/scripts"
QTERM_SESSION_ID = "ea9e223656664f2b850c8cfb45dd7b5c"
Q_SET_PARENT_CHECK = "1"
Q_TERM = "1.7.3"
SHELL = "/bin/zsh"
TERM = "xterm-256color"
__CFBundleIdentifier = "com.googlecode.iterm2"
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug: Amazon Q CLI fails for all model calls when a really large file is added to context #1254

Checks

Operating system

Expected behaviour

Bug: Amazon Q CLI fails with ValidationException when large file is added to context

Description

Expected behavior

Actual behavior

Context information

Environment

Additional context

Possible solutions

Actual behaviour

Steps to reproduce

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug: Amazon Q CLI fails for all model calls when a really large file is added to context #1254

Description

Checks

Operating system

Expected behaviour

Bug: Amazon Q CLI fails with ValidationException when large file is added to context

Description

Expected behavior

Actual behavior

Context information

Environment

Additional context

Possible solutions

Actual behaviour

Steps to reproduce

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions