// catalog · ls -la /lab/tools

Catalog/page 20/24

Showing entries 115 to 120 of 139 · click any row to launch

  1. 115 Vision / Audio alt-text:// Generates accessible alt text from any image. Returns three lengths: brief (under 10 words), standard (one sentence), detailed (2-3 sentences). Also flags decorative-only images that should get empty alt text + tells you when an image is doing real semantic work that needs a longer description.
  2. 116 Vision / Audio image-prompt:// Generates a text-to-image prompt that would recreate (or remix) the uploaded image. Names subject, style, composition, lighting, palette. Useful for finding a reference image you can iterate from in SDXL / Flux / Midjourney.
  3. 117 Vision / Audio image-caption:// Generates an image caption in your chosen voice: formal (museum-label), witty (one-line gag), poetic (3 lines), deadpan (literal-funny), or hype (social-post energy). Same image, different mood.
  4. 118 Vision / Audio image-classify:// Classifies any image: returns top-5 class labels with confidence percentages, the dominant category (object / scene / portrait / chart / screenshot / illustration), and 3 "what this image is NOT" anti-labels that other classifiers might get wrong.
  5. 119 Vision / Audio image-mood:// For designers: takes an image and translates it into a mood / palette / style brief you can hand to another designer or feed into a prompt. Names the colour palette in hex, the emotional read, the design references it echoes, the era / context.
  6. 120 Vision / Audio image-ocr:// Optical character recognition via vision model. Extracts text from any image. Handwriting, signage, document scans, screenshots, photos of receipts. Preserves layout where useful (line breaks, columns). Flags low-confidence words you should double-check.

← back to lab