Welcome to Software Development on Codidact!
Will you help us build our independent community of developers helping developers? We're small and trying to grow. We welcome questions about all aspects of software development, from design to code to QA and more. Got questions? Got answers? Got code you'd like someone to review? Please join us.
Posts by Franck Dernoncourt
I see on https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct/tree/main/onnx: File Name Size model.onnx 654 MB model_fp16.onnx 327 MB model_q4.onnx 200 MB m...
I have several resource groups and instances in my Azure account. Yet, the Azure Command-Line Interface (CLI) lists no groups and no resources in my Azure account: franck [ ~ ]$ az group list [] ...
I follow Azure's tutorial on fine-tuning GPT. I'm stuck at the deployment phase. Code: # Deploy fine-tuned model import json import requests token = '[redacted]' subscription = '[redacted...
Does the Rate limit (Tokens per minute) of an Azure OpenAI instance corresponds to the input tokens, the output tokens or the sum of both? Crossposted at: https://serverfault.com/q/1176440/...
Example: I specify a random model name some random name: Then I call it: Which model is Cursor IDE using when I specify a random model name? Cross-posted: https://stackoverflow.com/q/79...
I use Azure OpenAI RAG-QA (aka "bring our data"): which I call via e.g.: import os import pprint from openai import AzureOpenAI #from azure.identity import DefaultAzureCredential, get_bear...
Reddit user InsufficientBorder found the issue: If you have multiple subscriptions, and/or haven't set a default, the CLI will "randomly" set one to be the subject of any requests. Check you'v...
Does the Rate limit (Tokens per minute) of an Azure OpenAI instance corresponds to the input tokens, the output tokens or the sum of both? The sum of both input tokens and the output tokens. S...