GLM 5.2 beats Claude in our benchmarks

https://semgrep.dev/assets/semgrep-benchmarking-summary-with-padding.png
GLM 5.2, an open-weight model, beat Claude Code at IDOR detection with 39% F1, costing $0.17 per vulnerability found. The harness still matters more than the model, but open-weight models have crossed a threshold worth watching for security tasks.

HackerRank open sourced its ATS. My resume scored 90/100. Oh wait 74. No – 88

https://substackcdn.com/image/fetch/$s_!7pmg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bfee137-15fc-406e-b2db-91db9cdbb9f7_1980x900.png
An AI resume screening tool is flawed due to non-deterministic outputs from its LLM, leading to inconsistent scores. This tool fails to differentiate between qualified and unqualified candidates, essentially filtering based on luck rather than quality.

Age verification is just a precursor to automated attribution of speech

US states and countries are introducing age verification regulations under the guise of protecting children, but it's actually a precursor to attributing digital identities to physical ones, making it easier for law enforcement to identify and harass individuals. This could lead to automated identity attribution and increased surveillance, allowing governments to target inconvenient people.

Herdr: Agent multiplexer that lives in your terminal

https://raw.githubusercontent.com/ogulcancelik/herdr/master/assets/logo.png
Herdr is a terminal multiplexer with workspaces, panes, and tiling, supporting SSH and persistence. It uses explicit keybindings and shows agent states in the sidebar.

Dissecting Apple's Sparse Image Format (ASIF)

https://schamper.dev/dissecting-apples-sparse-image-format-asif/figure1.png
The user reverse engineered the ASIF file format used in macOS 26 Tahoe, discovering its structure and how to read from an arbitrary offset inside the virtual disk. The format uses a combination of tables, entries, and bitmaps to store and manage chunks of data, and the user provided a detailed explanation of how to calculate the correct chunk for a given offset.

Lore – give your coding agent the decisions your team made

https://repository-images.githubusercontent.com/1255866572/8702002c-6eb6-4689-b725-868c5177c008
Lore is a deterministic system of record that grounds agents against team decisions, citing current decisions and declining superseded ones. It uses Requirements as Code (RAC) to classify and validate artifacts, enforcing trust boundaries and preventing agent mutations.

Historical memory prices 1960-2026

The interactive dataset tracks historic and current memory and storage prices, including DRAM, NAND flash, and HBM, with modeled estimates from Epoch AI for AI-accelerator costs. The data is downloadable and includes quarterly updates for HBM and monthly updates for DRAM and NAND prices from 1957 to the present.

5k menus from the New York Public Library’s Buttolph Collection (1880-1920)

https://pudding.cool/2026/06/menu-story/assets/social-facebook.jpg
What do America’s earliest restaurant menus teach us about America?

I used Claude Code to get a second opinion on my MRI

https://antoine.fi/rails/active_storage/representations/proxy/eyJfcmFpbHMiOnsiZGF0YSI6NTU5LCJwdXIiOiJibG9iX2lkIn19--14f45f75a8f8216447265425e1b3cebe1d211a37/eyJfcmFpbHMiOnsiZGF0YSI6eyJmb3JtYXQiOiJwbmciLCJyZXNpemVfdG9fbGltaXQiOlsyMDQ4LDE1MzZdfSwicHVyIjoidmFyaWF0aW9uIn19--c0bbe91c3d5655e568ecfbbb413acaec2c5c367e/image.png
The author used Opus 4.8 to analyze an MRI and got a second opinion on their diagnosis, which disagreed with the original doctor's report. The AI analysis found no partial-thickness tear, contradicting the doctor's diagnosis and making the author question the treatment plan.

Knowledge Distillation of Black-Box Large Language Models (2024)

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Researchers introduced Proxy-KD, a method to transfer knowledge from black-box LLMs to smaller models efficiently. Proxy-KD surpasses traditional white-box KD techniques and enhances performance.

Deciphering Basmala

https://pic.blog.plover.com/TOP.jpg
An article discusses Arabic typesetting complications, including the importance of ligatures in Arabic script. The inclusion of a special Unicode codepoint solved the problem of rendering the basmala phrase correctly.

Show HN: Zanagrams

https://zanagrams.com/og-image.png
Zanagrams is a free daily word puzzle. Drag across the letters to find the hidden words and watch the grid shrink as you solve it. A new Zanagrams puzzle every day.

TOP500 at ISC’26: We have a New Number 1 Supercomputer

https://substackcdn.com/image/fetch/$s_!rXdh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11672287-1e72-4e2b-9220-bd12de3d8ab4_1200x900.png
The LineShine Supercomputer in China has taken the top spot on the 67th TOP500 list with 2.198 Exaflops of sustained FP64 performance. This is the first Chinese submission to the TOP500 in 9 years and marks a significant milestone for China's HPC capabilities.

Tokenmaxxing is dead, long live tokenmaxxing

https://substackcdn.com/image/fetch/$s_!rp_Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca712f-fd31-4c79-94b3-5a0ebe81bdb3_800x786.jpeg
The concept of tokenmaxxing, where companies spend large amounts of money on AI tokens without expecting a return, is not dead but rather evolving as companies realize the benefits of compounding correctness, where more tokens spent lead to better results. The shift towards open model platforms and generalist agents will lead to another rise in tokenmaxxing behavior, ultimately resulting in ...

Model Training as Code

https://aleph-alpha.com/_astro/00-cover-model-training-as-code.oaKwGkcy_Lw8uU.webp
Aleph Alpha built Savanna, a model factory that automates the entire model training pipeline in code, making it easier to collaborate and scale. By implementing Model Training as Code (MTaC), Savanna reduces errors, costs, and organisational challenges, enabling teams to work more efficiently and paving the way for auto-research.

The Boeing 747 begins its final descent

https://cdn.theatlantic.com/thumbor/HIoqct7cC0eypfzBtLSkiNp1U_A=/0x79:4932x2853/1440x810/media/img/2026/06/11/0726_WEL_Bogost_747_16x9/original.png
The author visits Pinal Airpark in Arizona to see retired Boeing 747s, once the principal host of important journeys, now a symbol of American decline. The 747 was a technological innovation that embodied American might, invention, and progress, but its accidental longevity defined an era of decline.

Professor denounces mass AI fraud on an exam at Brown

https://imagenes.elpais.com/resizer/v2/J6ITLVPHQNHI3BHNR3FPIX5URE.jpg?auth=29cc39cd3e10fbe5a5266dd40100f1d61e4fe145ccd2e4ab13b6a6bd0b67f1dd&width=414
Professor Roberto Serrano at Brown University detected massive cheating on a midterm exam using AI, with at least 50 students involved. He believes the university's response was inadequate and that AI is altering century-old traditions at elite universities, requiring a broader debate on academic integrity.

The KIDS Act would require age checks to get online

https://www.eff.org/files/banner_library/ageverificationbanner-3.png
The KIDS Act package includes bills that require age verification, government-directed moderation, and new rules for private communications. This could lead to restrictive age-checking practices, reduced online privacy, and limited free expression.

The Baffling World of Masayoshi Son's Presentations (2020)

Please make sure your browser supports JavaScript and cookies and that you are not blocking them from loading. For more information you can review our and .

The Forgotten Castles of the Garamantes

The Garamantes were an ancient Berber civilisation that built a sophisticated urban state in the Libyan Sahara, engineering water from deep underground to sustain their cities. Their legacy endures in the language and culture of the Tuareg, but their sites remain poorly protected and face new pressures from development and erosion.

Working around dragons with the Lemote Yeeloong laptop and OpenBSD

https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgwr0Vsnj1UuAJWn229_mBMPRg6E4JpX_jhbOUQTo00gTSHpVXRCDmY3SzMr9Qs46tAMdlzSXJFn5C8PLbPkKtn4_GtyIB-ytyD_h9b9u0-r0S_fL4IVoMLVt7RBO9OkQZ3ahw4p1ybokI3ucnJ3avc2dkFUGatYUTsPfvFEY2-cazKJSjes73VfqThHZ8/s320/stallman-lemote-2.JPG
The Loongson processor is a Chinese-developed CPU that emerged in 2001, initially based on MIPS architecture, and was later developed into a 64-bit processor. The Loongson processor was used in the Lemote Yeeloong laptop, a low-cost netbook that was released in 2008.

Librepods: AirPods liberated

https://raw.githubusercontent.com/librepods-org/librepods/main/imgs/banner.png
LibrePods is a project that allows non-Apple devices to use AirPods features. It implements the proprietary protocol used by Apple devices to exchange data with AirPods.

Daisugi, the Japanese technique of growing trees out of other trees (2020)

https://cdn8.openculture.com/2020/10/22225805/Daisugi.png
Daisugi, a Japanese technique, involves growing multiple trees from a single tree, creating a giant bonsai-like structure, but it's actually a form of coppicing, a common woodland management technique practiced for centuries. This method produces straight, usable timber without harming the original tree, and it's being re-emphasized as a sustainable way to manage woodlands.

A way to exclude sensitive files issue still open for OpenAI Codex

https://opengraph.githubassets.com/eb6eaf7d4028ea024171759a9f016eafa52d89840c3d40660be2d1d74b98731a/openai/codex/issues/2847
What feature would you like to see? A mechanism to explicitly mark files/paths that the agent must not read or send to the model, at both repository and global levels (e.g., a repo-local .codexignore plus a global ignore file). Example: ...

Examining circuit boards from the Space Shuttle's I/O Processor

https://static.righto.com/images/shuttle-ioprocessor/gpu-and-iop-w600.jpg
The Space Shuttle's I/O Processor had 24 network connections and 25 virtual processors, implementing two different instruction sets. It used microcode to run the virtual processors on one physical processor.

A glitch in February of the year 0

A team encountered a bug in parsing timestamps in the year 0 due to a PHP DateTimeImmutable issue. The problem was caused by a timelib library bug, which has been reported and a pull request has been opened to fix it.

More evidence is consistent with possible ancient life on Mars (2025)

https://i.cbc.ca/ais/1.7649703,1759672248000/full/max/0/default.jpg?im=Crop%2Crect%3D%28109%2C140%2C2415%2C1358%29%3B
Scientists have found potential evidence of life on Mars, but no conclusive proof, with the latest discovery being two chemicals formed by microbial activity or chemical reactions. The possibility of life on Mars has been tantalizing humans for over a century, with many previous claims debunked, but new research suggests life could exist underground.

Idler Magazine

The Idler Academy offers online classes, retreats, and a magazine focusing on philosophy, history, and personal growth through education. Their mission is to promote freedom and fulfillment through learning and laughter.

The curious case of the disappearing Polish S (2015)

https://aresluna.org/images/the-curious-case-of-the-disappearing-polish-s/ogimage.png
A Polish user reported a bug on Medium where Ś wouldn't appear when typed. The issue was caused by Medium blocking CtrlS to prevent browser save dialogs, but Polish users use AltS to type Ś.

Show HN: Bash4LLM+ – A lightweight, dependency-free Bash wrapper for LLM APIs

https://raw.githubusercontent.com/kamaludu/bash4llm/main/docs/img/bash4llm320.png
Bash4LLM⁺ è un wrapper CLI sicuro per l'API Chat Completions di OpenAI, scritto in Bash e completamente auditabile. È un singolo script auto-contenuto che può essere scaricato, eseguito e utilizzato subito.