Commit Graph

78 Commits

Author SHA1 Message Date
root
d93a7a74d5 feat(api): add Llama 3.3 70B Instruct to AWS Bedrock model options 2025-01-23 21:12:59 +02:00
sam hoang
f745f080f4 feat: add explicit Azure OpenAI flag and setup memory bank docs
- Add openAiUseAzure flag to force Azure OpenAI client initialization
- Add "Use Azure" checkbox in API settings UI

This change improves Azure OpenAI configuration flexibility by allowing users to explicitly opt-in to Azure client, regardless of the base URL pattern.
2025-01-24 00:14:55 +07:00
Matt Rubens
82a6d13c20 Change default 2025-01-22 08:18:29 -08:00
Tom X Nguyen
93a571038f fix: update output length to 65536 2025-01-22 22:35:15 +07:00
Tom X Nguyen
5d5e69fe21 chore: add gemini flash thinking 01-21 2025-01-22 19:14:19 +07:00
sam hoang
d1a65f4de0 refactor: rename openAiCusModelInfo to openAiCustomModelInfo for better clarity
- Rename openAiCusModelInfo to openAiCustomModelInfo across all files for better readability
- Update related variable names and references to maintain consistency
- Affects OpenAI provider, ClineProvider, WebviewMessage, API interfaces, and UI components
2025-01-21 23:06:07 +07:00
sam hoang
d50e075c75 feat(openai): add custom model info configuration
Adds support for configuring custom OpenAI-compatible model capabilities and pricing, including:

Max output tokens
Context window size
Image/computer use support
Input/output token pricing
Cache read/write pricing
2025-01-21 22:46:21 +07:00
Matt Rubens
a13da25dd1 Deepseek fixes 2025-01-20 11:41:38 -05:00
philipnext
cab095aef2 add: deepseek-reasoner model info 2025-01-20 22:52:24 +09:00
Matt Rubens
60a0a824b9 Prettier backfill 2025-01-17 14:11:28 -05:00
Saoud Rizwan
077fa84374 Add Mistral API provider 2025-01-17 01:11:16 -05:00
Matt Rubens
c673edd526 Merge remote-tracking branch 'origin/main' into vscode-lm-provider 2025-01-15 01:37:37 -05:00
Matt Rubens
344c796f2e Chat modes 2025-01-12 13:11:52 -05:00
Matt Rubens
7137f8c528 Spacing adjustments 2025-01-08 17:28:32 -05:00
RaySinner
d5fd2bbb82 merge: resolve conflicts after upstream merge 2025-01-08 23:47:26 +03:00
Matt Rubens
0aa5f1fb85 Add the o1 model 2025-01-07 13:33:47 -05:00
RaySinner
98b9007c36 merge(upstream): merge upstream changes keeping VSCode LM provider and adding Glama support 2025-01-07 01:54:46 +03:00
RaySinner
9d62a7bb77 feat(vscode-lm): implement VS Code Language Models provider 2025-01-07 01:23:22 +03:00
Matt Rubens
7383a0514a Merge pull request #282 from RooVetGit/open_ai_streaming_toggle
Streaming checkbox for OpenAI-compatible providers
2025-01-05 22:46:50 -05:00
Matt Rubens
2cdfff02c0 Streaming checkbox for OpenAI-compatible providers 2025-01-05 20:34:54 -05:00
Frank
8b0e9bbf32 fix: remove unnecessary comment 2025-01-05 19:25:35 -05:00
Frank
e5e700ffcb feat: add Glama gateway 2025-01-04 22:50:12 -05:00
Matt Rubens
6290f90fa5 Refactor 2024-12-29 12:20:35 -08:00
Matt Rubens
eb8c4cc50f Add DeepSeek to the list of providers 2024-12-29 11:27:31 -08:00
student
ffd51479c1 Fix gemini-2.0-flash-thinking-exp-1219 contewWindow 2024-12-20 00:32:40 +01:00
Matt Rubens
38138164e5 Add gemini-2.0-flash-thinking-exp-1219 2024-12-19 15:32:30 -05:00
Premshay
acf472aae0 Update api.ts 2024-12-15 11:28:54 +02:00
Premshay
8460a796e9 Merge branch 'main' into main 2024-12-15 11:15:43 +02:00
Matt Rubens
6ee118e0a2 Merge pull request #86 from RooVetGit/fix-openai-compatible-streaming
Add 'Include stream options' checkbox for OpenAI-compatible providers
2024-12-13 01:41:15 -05:00
Matt Rubens
be3d8a6166 Incorporate MCP changes (#93)
Co-authored-by: Saoud Rizwan <7799382+saoudrizwan@users.noreply.github.com>
2024-12-12 23:16:39 -05:00
Matt Rubens
2cfd76c305 Add 'Include stream options' checkbox for OpenAI-compatible providers 2024-12-12 09:20:58 -05:00
Matt Rubens
156cab8cbc Fix: Add checkbox for stream_options in OpenAI-compatible providers 2024-12-12 08:58:38 -05:00
Cline
51a57d5bbf fix(bedrock): improve stream handling and type safety
- Fix TypeScript error in ConverseStreamCommand payload
- Add proper JSON parsing for test stream events
- Improve error handling with proper Error objects
- Add test-specific model info with required fields
- Fix cross-region inference and prompt cache config
2024-12-11 22:47:59 +02:00
Cline
140318cecd feat(api): unify Bedrock provider using Runtime API
Problem:
The current Bedrock implementation uses the Bedrock SDK, which requires separate handling for different model types and doesn't provide a unified streaming interface.

Solution:
Integrate the Bedrock Runtime API to provide a single, unified interface for all Bedrock models (Claude and Nova) using the ConverseStream API. This eliminates the need for separate handlers while maintaining all existing functionality.

Key Changes:
- Refactored AwsBedrockHandler to use @aws-sdk/client-bedrock-runtime
- Enhanced bedrock-converse-format.ts to handle all content types and properly transform between Anthropic and Bedrock formats
- Maintained cross-region inference support with proper region prefixing
- Added support for prompt caching configuration
- Improved AWS credentials handling to better support default providers
- Added proper error handling and token tracking for all response types

Dependencies:
- Added @aws-sdk/client-bedrock-runtime for unified API access
- Removed @anthropic-ai/bedrock-sdk dependency

Testing:
- Verified message format conversion for all content types
- Tested cross-region inference functionality
- Validated streaming responses for both Claude and Nova models

This change simplifies the codebase by providing a single, consistent interface for all Bedrock models while maintaining full compatibility with existing features.
2024-12-11 22:47:56 +02:00
Cline
dffc040e7c feat(bedrock): Add Meta Llama 3, 3.1, and 3.2 models with detailed pricing and context windows 2024-12-11 22:31:32 +02:00
Matt Rubens
0e0c848a96 Fix Gemini 2.0 context window 2024-12-11 14:02:54 -05:00
Matt Rubens
65bbefed55 Add Gemini 2.0 2024-12-11 13:36:46 -05:00
Domenico Bochicchio | Asclepyus Srl
39b51fa78f Added support for gemini-exp-1206 (#54) 2024-12-09 09:37:38 -05:00
John Stearns
423e2af520 Continuing work on support for OpenRouter compression (#43) 2024-12-07 09:38:13 -08:00
Saoud Rizwan
39bc35eec1 Add LM Studio provider 2024-11-12 22:02:42 -05:00
Saoud Rizwan
ad29ff2a03 Add AWS cross-region inference toggle 2024-11-07 13:51:13 -05:00
Sam
2eb11aadc7 Fix: Add missing sonnet model to bedrock (#633) 2024-11-07 12:57:03 -05:00
Saoud Rizwan
48cdfa9874 Add Haiku 3.5 to Anthropic, Bedrock, Vertex 2024-11-04 14:08:25 -05:00
Saoud Rizwan
17f34e6225 Add 'supports computer use' prop; sync latest openrouter info on launch 2024-10-23 15:43:34 -04:00
Saoud Rizwan
e5266ddda1 Add fallback claude 3.5 sonnet v1 models for bedrock and vertex 2024-10-22 18:24:34 -04:00
Saoud Rizwan
165272ab47 Update Anthropic model IDs 2024-10-22 16:21:17 -04:00
Saoud Rizwan
7dd9abda46 Cleanup 2024-10-09 01:49:59 -04:00
Saoud Rizwan
f6a14fdfb9 Add markdown rendering for model description; fixes 2024-10-09 01:49:57 -04:00
Saoud Rizwan
7cb0c524e5 Add OpenRouter custom model scheme 2024-10-09 01:49:57 -04:00
Saoud Rizwan
5069e1a11b Use new gemini models 2024-09-25 00:28:18 -04:00