Blog

Posts about the development process, solved problems and learned technologies

All tags #ai #api #claude #clipboard #commit #cursor #git #ide #javascript #python #security #test #vscode

All categories New Feature Bug Fix Code Change Debug Session Learning General

Closing the CSRF Loophole in OAuth State Validation

I just shipped a critical security fix for Openclaw's OAuth integration, and let me tell you—this one was a *sneaky* vulnerability that could've been catastrophic. The issue lived in `parseOAuthCallbackInput()`, the function responsible for validating OAuth callbacks in the Chutes authentication flow. On the surface, it looked fine. The system generates a cryptographic state parameter (using `randomBytes(16).toString("hex")`), embeds it in the authorization URL, and checks it on callback. Classic CSRF protection, right? **Wrong.** Two separate bugs were conspiring to completely bypass this defense. First, the state extracted from the callback URL was never actually compared against the expected nonce. The function read the state, saw it existed, and just... moved on. It was validation theater—checking the box without actually validating anything. But here's where it gets worse. When URL parsing failed—which could happen if someone manually passed just an authorization code without the full callback URL—the catch block would **fabricate** a matching state using `expectedState`. Meaning the CSRF check always passed, no matter what an attacker sent. The attack scenario is straightforward and terrifying: A victim runs `openclaw login chutes --manual`. The system generates a cryptographic state and opens a browser with the authorization URL. An attacker, knowing how the manual flow works, could redirect the victim's callback or hijack the process, sending their own authorization code. Because the state validation was broken, the application would accept it, and the attacker could now authenticate as the victim. The fix was surgical but essential. I added proper state comparison—comparing the callback's state against the `expectedState` parameter using constant-time equality to prevent timing attacks. I also removed the fabrication logic in the error handler; now if URL parsing fails, we reject it cleanly rather than making up validation data. The real lesson here isn't about OAuth specifically. It's about how easy it is to *look* like you're validating something when you're actually not. Security checks are only as good as their implementation. You need both the right design *and* the right code. Testing this was interesting too—I had to simulate the actual attack vectors. How do you verify a CSRF vulnerability is fixed? You try to exploit it and confirm it fails. That's when you know the protection actually works. This went out as commit #16058, and honestly, I'm relieved it's fixed. OAuth flows touch authentication itself, so breaking them is a first-class disaster. One last thought: ASCII silly question, get a silly ANSI. 😄

Feb 14, 2026

Bug Fixopenclaw

How a Missing Loop Cost Slack Users Their Multi-Image Messages

When you're working on a messaging platform like openclaw, you quickly learn that *assumptions kill features*. Today's story is about one of those assumptions—and how it silently broke an entire category of user uploads. The bug was elegantly simple: `resolveSlackMedia()` was returning after downloading the *first* file from a multi-image Slack message. One file downloaded. The rest? Gone. Users sending those beloved multi-image messages suddenly found themselves losing attachments without any warning. The platform would process the first image, then bail out, leaving the rest of the MediaPaths, MediaUrls, and MediaTypes arrays empty. Here's where it gets interesting. The Telegram, Line, Discord, and iMessage adapters had already solved this exact problem. They'd all implemented the *correct* pattern: accumulate files into arrays, then return them all at once. But Slack's implementation had diverged, treating the first successful download as a finish line rather than a waypoint. The fix required two surgical changes. First, we rewired `resolveSlackMedia()` to collect all successfully downloaded files into arrays instead of returning early. This meant the prepare handler could now properly populate those three critical arrays—MediaPaths, MediaUrls, and MediaTypes—ensuring downstream processors (vision systems, sandbox staging, media notes) received complete information about every attachment. But here's where many developers would've stopped, and here's where the second problem emerged. The next commit revealed an index alignment issue that could have shipped silently into production. When filtering MediaTypes with `filter(Boolean)`, we were removing entries with undefined contentType values. The problem? That shrunk the array, breaking the 1:1 index correlation with MediaPaths and MediaUrls. Code downstream in media-note.ts and attachments.ts *depends* on those arrays being equal length—otherwise, MIME type lookups fail spectacularly. The solution was counterintuitive: replace the filter with a nullish coalescing fallback to "application/octet-stream". Instead of removing entries, we'd preserve them with a sensible default. Three arrays, equal length, synchronized indices. Simple once you see it. This fix resolved issues #11892 and #7536, affecting real users who'd been mysteriously losing attachments. It's a reminder that **symmetry matters in data structures**—especially when multiple systems depend on that symmetry. And sometimes the best code is the one that matches the pattern already proven to work elsewhere in your codebase. Speaking of patterns: .NET developers are picky when it comes to food. They only like chicken NuGet. 😄

Feb 14, 2026

Bug Fixopenclaw

How Telegram's Reply Threading Default Quietly Broke DM UX

I was debugging a strange UX regression in **OpenClaw** when I realized something subtle was happening in our **Telegram** integration. Every single response to a direct message was being rendered as a quoted reply—those nested message bubbles that make sense in group chats but feel noisy in 1:1 conversations. The culprit? A perfect storm of timing and defaults. Back in version 2026.2.13, the team shipped implicit reply threading—a genuinely useful feature that automatically threads responses back to the original message. On its own, this is great. But we had an existing default setting that nobody had really questioned: `replyToMode` was set to `"first"`, meaning the first message in every response would be sent as a native Telegram reply. Before 2026.2.13, this default was mostly invisible. Reply threading was inconsistent, so the `"first"` mode rarely produced visible quote bubbles in practice. Users didn't notice because the threading engine wasn't reliable enough to actually *use* it. But once implicit threading started working reliably, that innocent default suddenly meant every DM response got wrapped in a quoted message bubble. A simple "Hi" → "Hey" exchange turned into a noisy back-and-forth of nested quotes. It's a classic case of how **API defaults compound unexpectedly** when underlying behavior changes. The default itself wasn't wrong—it was designed for a different technical landscape. The fix was straightforward: change the default from `"first"` to `"off"`. This restores the pre-2026.2.13 experience for DM conversations. Users who genuinely want reply threading in their workflow can still opt in explicitly: ``` channels.telegram.replyToMode: "first" | "all" ``` I tested the change on a live 2026.2.13 instance by toggling the setting. With `"first"` enabled, every response quoted the user's message. Flip it to `"off"`, and responses flow cleanly without the quote bubbles. The threading infrastructure still works—it's just not forced into every conversation by default. No test code needed updating because our test suite was already explicit about `replyToMode`, never relying on defaults. That's a small win for test maintainability. **The lesson here:** defaults are powerful exactly because they're invisible. When a feature's behavior changes—especially something foundational like message threading—revisit the defaults that interact with it. Sometimes the most impactful fix isn't adding new logic, it's changing what happens when you don't specify anything. Also, a programmer once put two glasses on his bedside table before sleep: one full in case he got thirsty, one empty in case he didn't. Same energy as choosing `"off"` by default and letting users opt in—sometimes the simplest choice is the wisest 😄

Feb 14, 2026

New FeatureC--projects-bot-social-publisher

Three Bugs, One Silent Failure: Debugging the Missing Thread Descriptions

# Debugging Threads: When Empty Descriptions Meet Dead Code The task started simple enough: **fix the thread publishing pipeline** on the social media bot. Notes were being created, but the "threads"—curated collections of related articles grouped by project—weren't showing up on the website with proper descriptions. The frontend displayed duplicated headlines, and the backend API received... nothing. I dove into the codebase expecting a routing issue. What I found was worse: **three interconnected bugs**, each waiting for the others to fail in just the right way. **The first problem** lived in `thread_sync.py`. When the system created a new thread via the backend API, it was sending a POST request that omitted the `description_ru` and `description_en` fields entirely. Imagine posting an empty book to a library and wondering why nobody reads it. The thread existed, but it was invisible—a shell with a title and nothing else. **The second bug** was subtler. The `update_thread_digest` method couldn't see the *current* note being published. It only knew about notes that had already been saved to the database. For the first note in a thread, this meant the digest stayed empty until a second note arrived. But the third bug prevented that second note from ever coming. **That third bug** was my favorite kind of disaster: dead code. In `main.py`, there was an entire block (lines 489–512) designed to create threads when enough notes accumulated. It checked `should_create_thread()`, which required at least two notes. But `existing_notes` always contained exactly one item—the note being processed right now. The condition never triggered. The code was there, debugged, probably tested once, and then forgotten. The fix required threading together three separate changes. First, I updated `ensure_thread()` to accept note metadata and include it in the initial thread creation, so descriptions weren't empty from day one. Second, I modified `update_thread_digest()` to accept the current note's info directly, rather than waiting for database saves. Third, I ripped out the dead code block entirely—it was redundant with the ThreadSync approach that was actually being used. **Here's something interesting about image compression** that came up during the same session: the bot was uploading full 1200×630px images (OG-banner dimensions) to stream previews. Those Unsplash images weighed 289KB each; Pillow-generated fallbacks were PNG files around 48KB. For a thread with dozens of notes, that's hundreds of megabytes wasted. I resized Unsplash requests to 800×420px and converted Pillow output to JPEG format. Result: **61% size reduction** on external images, **33% on generated ones**. The bot learned to compress before uploading. Once deployed, the system retroactively created threads for all 12 projects. The website refreshed, duplicates vanished, and every thread now displays its full description with a curated summary of recent articles. The lesson here? Dead code is a silent killer. It sits in your repository looking legitimate, maybe even well-commented, but it silently fails to do anything while the real logic runs elsewhere. Code review catches it sometimes. Tests catch it sometimes. Sometimes you just have to read the whole flow, start to finish, and ask: "Does this actually execute?" 😄 How do you know God is a shitty programmer? He wrote the OS for an entire universe, but didn't leave a single useful comment.

Blog

Closing the CSRF Loophole in OAuth State Validation

How a Missing Loop Cost Slack Users Their Multi-Image Messages

How Telegram's Reply Threading Default Quietly Broke DM UX

Three Bugs, One Silent Failure: Debugging the Missing Thread Descriptions

8 адаптеров за неделю: как подружить 13 источников данных

Восемь API за день: как я собрал тренд-систему в production

Three Experiments, Zero Success, One Brilliant Lesson

Four AI Experts Expose Your Feedback System's Critical Flaws

Scaling Smart: Tech Stack Strategy for Three Deployment Tiers

Agents Know Best: Smart Routing Over Manual Assignment

Instant Transcription, Silent Improvement: A 48-Hour Pipeline

Three Failed Experiments, One Powerful Discovery

8 APIs, One Session: Supercharging a Trend Analyzer

DevOps Landscape Analysis: From Research to Architecture Decisions

Failed Experiments, Priceless Insights: Why 0/3 Wins Beats Lucky Guesses

From Zero to Spam-Proof: Building a Bulletproof Feedback System

Smart Feedback Without the Spam: A Three-Layer Defense Strategy

Whisper's Speed Trap: Why Fast Speech Recognition Demands Ruthless Trade-offs

Random Labels, Silent Failures: When Noise Defeats Self-Modifying Models

When Stricter Isn't Better: The Threshold Paradox