AI Safety
32 posts
Healthcare AI Compliance in India: ABHA, SAHI, BODH, and FHIR
A practical guide to governing health-data AI workflows with ABHA-aware redaction, SAHI checks, BODH evidence, and FHIR-ready streams.
1 May 2026 · 5 min read
Aadhaar Detection: Verhoeff Checksums and Why Regex Isn't Enough
Regex finds shapes. Checksums reduce false positives and make PII controls credible.
27 Apr 2026 · 5 min read
Aadhaar Detection Needs More Than Regex
AI safety operating note 1: a practical note for AI platform engineers on plain regex catches invoice numbers and misses obfuscated Aadhaar text.
2 Apr 2026 · 5 min read
Output Scanning for PII Leakage
AI safety operating note 2: a practical note for LLM application teams on models can reintroduce personal data even after input redaction.
1 Apr 2026 · 5 min read
Prompt Injection Controls for Compliance Agents
AI safety operating note 3: a practical note for security teams on malicious pages can ask an agent to ignore privacy policy.
31 Mar 2026 · 5 min read
Trust Scores for AI Agents
AI safety operating note 4: a practical note for AI governance leads on binary pass/fail hides slow agent degradation.
30 Mar 2026 · 5 min read
Shadow AI Detection on Public Websites
AI safety operating note 5: a practical note for risk teams on marketing scripts quietly call model APIs outside governance.
29 Mar 2026 · 5 min read
Circuit Breakers for Unsafe AI Behavior
AI safety operating note 6: a practical note for SRE teams on unsafe agents continue serving traffic after repeated violations.
28 Mar 2026 · 5 min read
Safe Prompt Templates for Regulated Teams
AI safety operating note 7: a practical note for developer-experience teams on free-form prompts drift away from approved policy.
27 Mar 2026 · 5 min read
Human Review Queues for High-Risk AI Calls
AI safety operating note 8: a practical note for operations managers on some requests should pause instead of being auto-answered.
26 Mar 2026 · 5 min read
AI Safety Regression Tests for Indian PII
AI safety operating note 9: a practical note for QA leaders on a scanner update can break Aadhaar or PAN detection silently.
25 Mar 2026 · 5 min read
False Positives in PII Redaction
AI safety operating note 10: a practical note for product teams on over-redaction makes AI answers useless.
24 Mar 2026 · 5 min read
Multilayer PII Defense for LLM Gateways
AI safety operating note 11: a practical note for security architects on single-detector systems fail on formatting tricks.
23 Mar 2026 · 5 min read
Safe Defaults for New AI Agents
AI safety operating note 12: a practical note for platform owners on new agents launch without controls because setup is optional.
22 Mar 2026 · 5 min read
Streaming Response Safety
AI safety operating note 13: a practical note for real-time AI teams on unsafe tokens can reach users before a full scan completes.
21 Mar 2026 · 5 min read
Red Teaming AI Compliance Workflows
AI safety operating note 14: a practical note for security reviewers on happy-path demos miss real attacker behavior.
20 Mar 2026 · 5 min read
Safe Retrieval for Private Documents
AI safety operating note 15: a practical note for RAG platform teams on retrieval can leak documents across tenants.
19 Mar 2026 · 5 min read
Model Failover Without Policy Drift
AI safety operating note 16: a practical note for reliability engineers on fallback providers may not share the same privacy settings.
18 Mar 2026 · 5 min read
AI Safety Dashboards for Non-Engineers
AI safety operating note 17: a practical note for DPOs and founders on technical logs do not create operational understanding.
17 Mar 2026 · 5 min read
Toxicity Checks Are Not Compliance Checks
AI safety operating note 18: a practical note for AI teams on generic moderation misses India-specific privacy risk.
16 Mar 2026 · 5 min read
Agent Tool Permissions as a Safety Boundary
AI safety operating note 19: a practical note for engineering leads on an agent with broad tools can expose data by action, not text.
15 Mar 2026 · 5 min read
AI Safety for Hindi and Hinglish Inputs
AI safety operating note 20: a practical note for Indian product teams on mixed-language prompts bypass English-only controls.
14 Mar 2026 · 5 min read
Measuring Redaction Quality
AI safety operating note 21: a practical note for governance teams on a redaction count does not prove quality.
13 Mar 2026 · 5 min read
Sensitive Output Replacement Patterns
AI safety operating note 22: a practical note for frontend teams on blocked answers need useful user-facing replacements.
12 Mar 2026 · 5 min read
Model Cost Controls as Safety Controls
AI safety operating note 23: a practical note for finance and platform teams on cost spikes can signal abuse or runaway agents.
11 Mar 2026 · 5 min read
Safety Policies for AI Copilots
AI safety operating note 24: a practical note for enterprise product teams on copilots see broad data but have vague responsibilities.
10 Mar 2026 · 5 min read
Audit-Ready AI Incident Timelines
AI safety operating note 25: a practical note for incident responders on AI incidents become unclear when logs live across tools.
9 Mar 2026 · 5 min read
Guarding Against Prompt Data Exfiltration
AI safety operating note 26: a practical note for security engineers on attackers ask models to reveal hidden context.
8 Mar 2026 · 5 min read
Safe Evaluation Datasets for AI Governance
AI safety operating note 27: a practical note for ML evaluation teams on evaluation data accidentally includes live personal data.
7 Mar 2026 · 5 min read
Provider-Agnostic AI Safety Controls
AI safety operating note 28: a practical note for platform teams on switching models can bypass library-specific guardrails.
6 Mar 2026 · 5 min read
AI Safety Sign-Off Before Production
AI safety operating note 29: a practical note for release managers on AI features ship without a crisp owner.
5 Mar 2026 · 5 min read
Latency Budgets for Safety Checks
AI safety operating note 30: a practical note for SRE and product teams on slow controls get bypassed during launches.
4 Mar 2026 · 5 min read