// catalog · ls -la /lab/tools

Catalog/page 21/24

Showing entries 121 to 126 of 139 · click any row to launch

  1. 121 Vision / Audio image-receipt:// Extracts structured data from receipt / invoice images. Returns JSON with vendor, date, line items (description + amount), subtotal, tax, total. Flags items with low confidence. Handles paper receipts, restaurant bills, and PDF invoices.
  2. 122 Vision / Audio image-compare:// Compares two images and surfaces what's the same, what's different, with a similarity score (0-100). Names structural differences, colour differences, content differences. For QA-ing design diffs, before-after comparisons, "is this the same image?" checks.
  3. 123 Vision / Audio image-summarise:// Summarises a set of images into prose: shared themes, common elements, outliers. For mood-board reviews, customer-photo bulk processing, event photo selection. Returns the summary + a "lead image" recommendation.
  4. 124 Vision / Audio image-safety:// Runs an image through content-safety classification. Returns flags for: NSFW, violence, hate symbolism, medical / graphic, minors in inappropriate context, weapons, drug paraphernalia. Each flag gets a confidence level + the specific signal that triggered it.
  5. 125 Vision / Audio logo-critique:// Critiques a logo design across legibility, balance, distinctiveness, scalability. Returns 3 specific fix directions (not just "make it better"). Friendly but honest. Designed to be the second opinion a designer's spouse wishes they could give.
  6. 126 Vision / Audio screenshot-explain:// For accessibility audits + bug reports: takes a UI screenshot and narrates what's on the screen in screen-reader order. Lists actionable controls (buttons / links / inputs) separately. Flags accessibility concerns (low contrast, ambiguous icons).

← back to lab