2025-05-22 15:59:51

danvoronov · danvoronov · commit bd764293e076 · 2025-05-22T15:59:51.000+03:00
diff --git a/eng_2025/05/2025-05-22-15-20.md b/eng_2025/05/2025-05-22-15-20.md
@@ -0,0 +1,21 @@
+I remember back in the GPT-4 era, many custom models appeared, specifically "tuned" for programming. There were even separate models for Python. Phind.com was doing cool stuff. Then it all somehow subsided, and most universal models became good at writing code anyway.
+
+https://windsurf.com/blog/windsurf-wave-9-swe-1
+Windsurf recently released their **SWE-1** models, but I think this is more of a step to reduce external API costs.
+
+The Mistral company still provides API access to the closed **Codestral** model, last updated January 2025.
+
+---
+
+And here we have a new turn, now models are being configured for background **independent coding of a range of tasks** from a git repository. OpenAI has just re-released `Codex`, now based on `o3`. Github has updated its agent, adding a background work function.
+
+https://mistral.ai/news/devstral
+Mistral's answer is the **Devstral model**, developed jointly with [All-hands](https://www.all-hands.dev/) (an open-source clone of the AI developer Devin). Unlike Codestral, the license here is Apache 2.0, meaning free use and modification. The model is also available via API under the name `devstral-small-2505`.
+
+What the model does better:
+- Parses large repositories
+- Finds connections between components
+- Scans code for errors
+- Is trained to solve real problems from GitHub
+
+According to All Hands AI 🙌, Devstral outperforms significantly larger models such as `Deepseek-V3-0324 (671B)` and `Qwen3 232B-A22B`. At the same time, Devstral is light enough to **run on a single RTX 4090** or a Mac with 32 GB of RAM, making it an ideal choice for background local use.
diff --git a/ukr_2025/05/2025-05-22-15-20.md b/ukr_2025/05/2025-05-22-15-20.md
@@ -0,0 +1,21 @@
+Пам'ятаю, за часів GPT-4 з'являлися багато кастомних моделей, спеціально "заточених" під програмування. Були навіть окремі моделі під Python. phind.com робив круті штуки. Далі це все якось стихло, більшість універсальних моделей і так стали добре писати код. 
+
+https://windsurf.com/blog/windsurf-wave-9-swe-1
+Windsurf нещодавно випустили свої моделі **SWE-1**, але я думаю це скоріше крок щоб скоротити витрати на зовнішні API.
+
+Компанія Mistral досі надає API доступ до закритої моделі **Codestral**, останнє оновлення січень 2025.
+
+---
+
+І ось у нас новий виток, тепер моделі налаштовують на фонове **самостійне вирішення низки завдань** з git-репозиторію. OpenAI тільки но перевипустили Codex, тепер заснувавши модель на o3. Github оновили агента, додавши функцію фонової роботи. 
+
+https://mistral.ai/news/devstral
+Відповідь від Mistral — це **модель Devstral**, розроблена спільно з [All-hands](https://www.all-hands.dev/) (опен-сорс клон ШІ розробника Devin). На відміну від Codestral ліцензія тут Apache 2.0, тобто вільне використання та модефікація. Модель також доступна через API під назвою `devstral-small-2505`.
+
+Що краще робить модель:
+- розбирає велики репозиторії
+- знаходить зв'язки між компонентами
+- сканує код на помилки
+- Модель навчена розв'язувати реальні проблеми з GitHub
+
+За даними All Hands AI 🙌Devstral перевершує значно більші моделі, такі як Deepseek-V3-0324 (671B) та Qwen3 232B-A22B. При цьому Devstral достатньо легка, щоб **працювати на одній RTX 4090** або Mac з 32 ГБ оперативної пам'яті, що робить її ідеальним вибором для фоновго локального використання.