Hey all, wanted to share a first draft of what I’ve been working on!
Aeon takes written documents, such as web pages, Google docs, even audio transcriptions, and turns them into high-quality ‘explainer’ style videos. The videos have a professional look and feel, with graphics and structure rivaling that of what human producers do.
In this example, we took the following, admittedly very boring website:
Check out the walkthrough here:
The user just gives very few instructions: Make a commercial, focus on the partners at the firm, etc.
Then a whole bunch of AI tools take over: GPT3, BLIP, NLP tools, Elevenlabs TTS, facial detection, etc. There are still some glitchy bits, but I’ll iron those out.
What was super fun (and challenging) was that GPT3 actually “directed” the visuals. I have approx a dozen scenes and styles that I describe to GPT3, which then decides what type of visual and motion graphics would work for each scene.
For the Shotstack folks: thank you for making a great product! I definitely have suggestions as to what would make my life easier. Would love any and all feedback you all have too.