Welcome to this week's edition of Overclocked!
This week, Deepseek OCR gives us a glimpse into what AI may look like in the future and why visual context compression could reset how models handle long documents. Then, we break down OpenAI’s latest acquisition and what it signals. Let’s dive in ⬇️
In today’s newsletter ↓
🧠 Deepseek OCR and the future of long context
🖥️ OpenAI buys Sky for Mac and what that means
📓 Turbo AI notetaker races to five million users
🌦️ Global weather body backs AI-powered forecasts and warnings
🎯 Weekly Challenge: Try Deepseek OCR
🐳 Deepseek OCR Might Be Another Game-Changer
Deepseek is pushing a fresh idea for long context. Instead of feeding walls of text into a model, it arranges the content into a high resolution image, then uses a decoder that can read from that image. You get fewer tokens to process while keeping structure that models usually struggle to hold over many pages.
Think contracts with dense schedules, research packs with charts and footnotes, or spreadsheets pasted into a slide. When the model sees the layout and not just a stream of characters, it can keep relationships that often vanish in plain text, like which number belongs to which clause or which footnote modifies which sentence.
🧰 What Changes for You
Every long task has a token tax. If you can send one or two visual frames instead of a pile of tokens, cost drops and speed improves. The biggest wins show up on mixed content, for example tables with small fonts and labels, price sheets with footnotes, or PDFs where text and images are tightly packed.
🚨 DeepSeek just did something wild.
They built an OCR system that compresses long text into vision tokens literally turning paragraphs into pixels.
Their model, DeepSeek-OCR, achieves 97% decoding precision at 10× compression and still manages 60% accuracy even at 20×. That
— #God of Prompt (#@godofprompt)
11:22 AM • Oct 20, 2025
Even simple reads feel more stable, since the image preserves where things live on the page, which helps the model reason about context rather than juggling a flattened stream.
🧪 How to Try It
Keep it practical. Pick one real document you already need this week. Pass a single clear page through an OCR to image pipeline, then ask a concrete question, such as list every date and amount, or find the late delivery risk clauses and return the exact lines.
Letsss gooo! DeepSeek just released a 3B OCR model on Hugging Face 🔥
Optimised to be token efficient AND scale ~200K+ pages/day on A100-40G
Same arch as DeepSeek VL2
Use it with Transformers, vLLM and more 🤗
— #Vaibhav (VB) Srivastav (#@reach_vb)
7:12 AM • Oct 20, 2025
Compare the result to your normal text only method. You are looking for three things, a faster answer, fewer misses on small but important details, and references that point to the right spots on the page.
🚀 What’s Next
Expect encoders that protect sensitive fields, decoders that get better at numbers and names, and workflows that store compressed visual context for common references, like vendor terms or house templates. If this approach lands in everyday tools, your long context strategy shifts.
You bring large archives into reach without constant trimming, you ask better questions over structured pages, and you stop wrestling with token limits that used to block real work.
🖥️ OpenAI Makes Another Big Acquisition
We’ve acquired Software Applications Incorporated, the maker of Sky—a natural language interface for Mac.
— #OpenAI Newsroom (#@OpenAINewsroom)
5:00 PM • Oct 23, 2025
OpenAI acquired the team behind Sky, a native Mac interface that lets an assistant understand what is on your screen and act inside apps with your permission. The plan is to fold the technology and the people into ChatGPT clients so the assistant can operate more naturally on the desktop.
The focus is system level awareness, clear prompts before actions, and a smoother way to take small steps like moving data between documents or clicking through a simple workflow.
🧩 Signals and Strategy
This leans into a desktop first future rather than a browser wrapper. A helpful assistant needs to know what is visible, where your cursor is, and which app holds the item you care about.
Sky was built for that kind of context. Pulling it into ChatGPT suggests deeper operating system hooks, along with clearer guardrails around what the assistant can and cannot do. You should expect more visible permission prompts, a slower and safer pace for actions, and a bias toward explain as it goes.
📍 Place in the Market
If the Sky approach lands inside ChatGPT on Mac, common chores get easier, like summarizing the visible section of a PDF, turning a selected table into a spreadsheet, or starting a calendar event from text on screen. Timeline and safety are the big questions.
The direction is clear, the assistant is moving closer to your apps, and you will approve each step as it happens.
The Weekly Scoop 🍦
🎯 Weekly Challenge: Try Deepseek OCR on Your Own
Challenge: Pick the easiest path for you and get one answer out of a real document today.
Here’s what to do:
🔗 Option 1: No-code demo
Upload one page from your PDF or a clear PNG.
Ask a concrete question like “list all dates and amounts.”
(Deepseek OCR is available to try on Hugging Face and on GitHub, per recent coverage.)
🔑Option 2: API quick test
Create a Deepseek account and copy your API key.
Use any OpenAI-compatible SDK and set the base URL to
https://api.deepseek.com.Send a single page image and ask your question. Keep it simple, then save the answer.
📄 Option 3: Local run
Visit the Deepseek-OCR repo on GitHub.
Follow the readme to run the sample script on a machine with a recent NVIDIA GPU.
👀 What to look for
Did it answer your question clearly on the first try
Was it faster than your normal method
Would you use it again for this type of file
That is it. One page, one question, one result.

That’s it for this week! Has Deepseek created yet another AI breakthrough to challenge major closed-source models? And, will OpenAI’s acquisitions result in better products or just more investment opportunities for the company? Hit reply and let us know your thoughts.
Zoe from Overclocked
