Dream Manifest (Personal Project)

This API extracts speech from video, transcribes it to insight text, and generates images from that text using Google’s Gemini AI — a complete pipeline from video to visuals. Check here for details.




Enjoy Reading This Post?

Here are some more posts you might like to read next:

  • Multi-AI Agent for Discord (Personal Project)
  • Llama Paper Summary (Personal Project)
  • MSTA3D - Multi-scale Twin-attention for 3D Instance Segmentation
  • Mobile Robot (Personal Project)
  • 3DoF Arm Robot (Personal Project)