When Veo3 first came, I was absolutely blown away. The quality of the output that people were able to create was nothing short of stunning. The addition of audio to the video output took the clips to the next level. Anyone can create world-class video content; the only limit is your imagination.
If you’re like me, the first few times you used Veo3, you put a prompt in that might have been something like these (these are the actual prompts I used):
“Televised footage of a st bernard doing an acrobatic dive into a swimming pool at the olympics, from a 10m high diving board, he bounced on it twice and then flips and spins into the water.”
“Create a scene where a cartoon panda is surrounded by FBI agents. The panda starts doing king fu moves kicking and hitting them and they go flying through the area like in a John Wick movie.”
A Saint Bernard in a forest. It's raining outside. He's cooking two burgers over a campfire. There is cheese on the burgers. He says, "this was my best friend cow but i got hungry."
And then moved to more complex ones like this:
Create a hyper-realistic shot of Sasquatch, an eight foot tall broad-shouldered creature covered in thick chestnut brown fur, deep amber eyes gleaming beneath a low brow, a broad flat nose, and a mischievous crooked grin, wearing oversized dark green lederhosen with cream suspenders stretched across his massive chest, a slightly wrinkled white festival shirt rolled at the sleeves, and a small alpine hat tipped with a blue feather. He stands upright inside a brightly lit Oktoberfest beer tent, holding a foaming one-liter stein in his left hand while extending a compact handheld vlogging camera at whole arm’s length with his right, the lens pointed directly back at him. Sasquatch shifts to fit beneath the canvas ceiling, his furry shoulders brushing decorative garlands as he declares in a deep, jovial voice, “I’m going to get drunk with Steve, Scott, and Adam tonight.” Rows of long wooden tables crowd the background, packed with revelers in colorful dirndls and lederhosen who sway together, mugs raised high, cheering and chanting “Ein Prosit der Gemütlichkeit.” Warm amber festoon lights cast a golden glow while blue and white banners ripple overhead, droplets of spilled beer occasionally speckling the lens. Environmental audio captures thunderous crowd cheering, rhythmic stomping on benches, hearty chanting, clinking glass steins, and distant laughter, with no music included. The handheld footage rocks with subtle natural shakes as Sasquatch adjusts his grip, the framing slightly off-center before he takes a slow forward step that adds a gentle push in movement, the camera tilting up to keep his towering head in view. The entire footage must be seen exactly as recorded through Sasquatch’s handheld selfie vlogging camera, with minor natural shakes, slight off-centering, and subtle repositioning as he navigates the bustling tent. No external cameras, no third-person views — only the imperfect handheld view from Sasquatch’s own perspective. No captions must be generated. No Cinematic Black Bars must be generated. Aspect ratio must be in 16:9. No watermark must be generated.”
Even with a more complex prompt like that, while the output was good, it wasn’t at the level of content I was seeing others create. I was trying to figure out how to make this better. I’m not a cinematography expert by any stretch of the imagination, but I figured I had to be missing something.
Turns out I was. That missing something was JSON. I started seeing people on X post amazing videos and sharing their output.
One of the creators I stumbled across was Emily (https://x.com/IamEmily2050). She is putting out amazing-looking content and is openly sharing her prompts. This is one from the following post: https://x.com/IamEmily2050/status/1948575759066616188
{
"model": "veo-3.0-fast",
"shot": {
"duration": 8,
"aspect_ratio": "16:9",
"composition": "crane-to-orbit establishing, 35 mm gate",
"camera_motion": "0-5 s: descend & orbit to 45°; 5-8 s: static hover",
"frame_rate": "24fps",
"film_grain": "Kodak 50D, subtle"
},
"scene": {
"setting": "open-air stadium pitch",
"lighting": "bright midday sun, hard shadows",
"atmosphere": "clear tropical"
},
"subjects": {
"count": 500,
"description": "Brazilian men and women in green-yellow-gold samba costumes with feathered headdresses",
"props": "oversized folding fans in radial carnival patterns"
},
"action": {
"0-2": "crowd sways left→right, fans held closed at waist",
"2-4": "samba hip roll, fans snap open creating giant yellow diamond shape",
"4-6": "formation pivots 90°, fans ripple outward in wave pattern",
"6-8": "dancers knee-dip, fans flip shut in unison, freeze pose"
},
"audio": {
"music": "energetic samba bateria (surdo, repinique, agogô, cuíca)",
"fx": "synchronized fan snap, stadium reverb"
},
"visual_details": {
"color_palette": "emerald, canary yellow, gold, white turf lines",
"textures": "sequins, feather plumes, silk fan fabric"
},
"cinematography": {
"lens": "24 mm rectilinear",
"filters": "polarizer for saturated sky & costumes",
"exposure": "1/96 s shutter, T8"
},
"visual_rules": {
"emphasis": [
"timed fan choreography",
"glittering feathers under sunlight"
],
"avoid": [
"crowd close-ups",
"text overlays",
"excess motion blur"
]
}
}
I thought, 'How on earth does she know how to create all of those structures?' How do you figure that out? I wouldn’t know all the decisions involved in lenses, filters, exposure, motion blur, textures, color palettes, etc. As I followed more and more people who were doing extraordinary things, I realized that AI was doing all of that for them.
I then started working with Gemini 2.5 Pro, experimenting with prompts to generate this type of JSON output. I spent a week working with Gemini to refine the output, and I put the result in a GEM. Now, when I have an idea, I simply drop it into that GEM and get fantastic JSON output that I can put straight into Veo3, yielding excellent results.
Let me show you the differences. I decided to start with this prompt: “Side view at tire level of a slate gray 2007 Porsche 911s racing at high speed along a mountain road at the golden hour. After 2 seconds the camera pulls back higher and moves directly behind and above the 911 as the car continues to accelerate to extremely high speeds while navigating the mountain road.”
I dropped it into Veo3 in my Gemini app. The first video that came out was kind of hilarious, the car instantly transforms from sideways to backwards to forward. Link here:
The second one running the same prompt came out better (you’ll find that when you do a lot of videos, you often have to run multiple times). It’s pretty close to my original vision. Link here:
However, I then decided to incorporate that exact description into my new Video JSON prompt generator. This is the output I got back from it. As you can see, it is WAY more detailed, super specific, and has incredible levels of insight around camera shots, sun flares, sound, aspect ratios, angles, etc. It’s a whole different level of control. The Veo3 engine is remarkably adept at processing this input.
{
"characters": [
{
"name": "2007 Porsche 911 Carrera S",
"age": 0,
"physicality": {
"height": "4'3\" / 130 cm",
"build": "compact, athletic coupe with wide rear haunches",
"posture_and_gait": "low, aggressive stance; moves with precision and speed"
},
"appearance": {
"skin_tone": "Slate Grey Metallic paint finish",
"hair": "N/A",
"eyes": "Bi-Xenon headlights with a clear, focused beam",
"distinguishing_marks": "Classic teardrop side windows, iconic rear engine decklid, and red brake calipers visible behind 19-inch 'Lobster Claw' wheels."
},
"persona": {
"demeanour": "N/A",
"voice": "A distinctive, high-revving flat-six engine roar",
"core_motivation": "To deliver a pure, high-performance driving experience.",
"wardrobe_default": "Factory stock appearance, immaculately clean."
},
"performance_defaults": {
"mouth_shape_intensity": 0.0,
"eye_contact_ratio": 0.0
}
}
],
"clips": [
{
"id": "S01_TireLevel_Pass",
"duration_sec": 2,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Medium Shot",
"camera_angle": "Extreme Low Angle",
"lens": {
"focal_length_mm": 35,
"aperture_fstop": 2.8,
"lens_effects": "anamorphic lens flare"
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The spinning front wheel and the Porsche crest on the center cap.",
"focus_pull": "None"
},
"camera_motion": {
"type": "dolly",
"description": "Camera tracks perfectly parallel to the car, locked on the front wheel as it blurs past. The motion is smooth and fast.",
"gear_emulation": "Russian Arm on a camera car"
}
},
"scene_description": {
"location": "A winding asphalt mountain road.",
"time_of_day": "golden hour",
"weather": "Clear and calm, the low sun creates long shadows.",
"environment_details": "The asphalt is dark and clean. A painted yellow line blurs into a solid streak. The background is a mix of rock face and motion-blurred guardrails.",
"set_dressing": "None"
},
"subject_performance": {
"character_name": "2007 Porsche 911 Carrera S",
"action_and_blocking": "The car moves from right to left across the screen at approximately 80 mph. The wheel is spinning rapidly, suspension compressing slightly over unseen road imperfections.",
"expression_and_emotion": "The car exudes pure, controlled speed.",
"gaze_direction": "N/A",
"wardrobe_in_scene": "Slate Grey Metallic paint, clean.",
"props": "N/A"
},
"art_direction": {
"lighting": {
"style": "Naturalistic",
"key_light": "The low, warm, hard light of the setting sun from camera left.",
"fill_light": "Soft, ambient bounce light from the sky and road surface.",
"rim_light": "The setting sun creates a bright golden glint along the car's roofline and fender.",
"motivated_sources": "The setting sun."
},
"color": {
"color_grade_style": "Teal & Orange blockbuster",
"color_palette": [
"#5A6468",
"#E87A3E",
"#222222",
"#F4D8A0"
]
},
"visual_style": {
"mood": "energetic",
"film_stock_emulation": "Kodak Vision3 250D",
"stylistic_effects": "Motion blur on the road surface and background. Strong anamorphic lens flares when the sun hits the lens."
}
},
"audio_design": {
"dialogue": null,
"sound_effects": {
"ambience": "High-speed wind roar.",
"spot_sfx": "The distinct, sharp roar of a Porsche flat-six engine at mid-to-high RPMs. The whir of the tires on asphalt."
},
"music_score": {
"style": "Tense and building synthwave",
"instrumentation": "Driving synth bassline, arpeggiated melodies, and a simple electronic drum beat.",
"mood": "anticipatory",
"tempo_bpm": 120
}
}
},
{
"id": "S02_Aerial_Chase",
"duration_sec": 5,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Wide Shot",
"camera_angle": "High Angle",
"lens": {
"focal_length_mm": 24,
"aperture_fstop": 5.6
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "The Porsche 911 and the road ahead.",
"focus_pull": "None"
},
"camera_motion": {
"type": "drone",
"description": "Camera starts low and close behind the car, then rapidly pulls up and back, revealing the vast mountain landscape as the car accelerates away down the winding road.",
"gear_emulation": "FPV drone"
}
},
"scene_description": {
"location": "A winding asphalt mountain road.",
"time_of_day": "golden hour",
"weather": "Clear and calm.",
"environment_details": "The road snakes through a vast mountain range. The setting sun casts a warm, orange glow across the peaks and valleys. Long, dramatic shadows are cast by the mountains.",
"set_dressing": "Pine trees and rocky outcroppings line the road."
},
"subject_performance": {
"character_name": "2007 Porsche 911 Carrera S",
"action_and_blocking": "The car accelerates hard, its engine note rising to a crescendo. It expertly navigates a series of S-curves, brake lights flashing briefly before each turn before powering out.",
"expression_and_emotion": "The car appears powerful and agile, dominating the road.",
"gaze_direction": "N/A",
"wardrobe_in_scene": "Slate Grey Metallic paint, catching the golden sunlight.",
"props": "N/A"
},
"art_direction": {
"lighting": {
"style": "Naturalistic",
"key_light": "The low, warm, hard light of the setting sun, now acting as a backlight.",
"fill_light": "Strong ambient light from the vast open sky.",
"rim_light": "The sun creates a brilliant golden silhouette around the car.",
"motivated_sources": "The setting sun."
},
"color": {
"color_grade_style": "Teal & Orange blockbuster",
"color_palette": [
"#5A6468",
"#E87A3E",
"#222222",
"#F4D8A0"
]
},
"visual_style": {
"mood": "epic",
"film_stock_emulation": "Kodak Vision3 250D",
"stylistic_effects": "Subtle lens flare. The vastness of the landscape is emphasized."
}
},
"audio_design": {
"dialogue": null,
"sound_effects": {
"ambience": "The sound of wind becomes more distant and is replaced by a faint echo.",
"spot_sfx": "The engine roar reaches its peak, echoing off the mountains. Faint tire squeal is heard during the sharp turns."
},
"music_score": {
"style": "Triumphant and soaring synthwave",
"instrumentation": "The main synth melody kicks in, layered with pads and a more powerful drum track.",
"mood": "triumphant and soaring",
"tempo_bpm": 120
}
}
}
]
}
Here’s the link to that version:
Here’s that same version upscaled to 1080p (same video, higher quality):
So, I did lots of experimentation and decided that now that I had a good JSON engine, I wondered if I could do something cool. I’ve always loved the epic Apple 1984 commercial, which aired only once at the Super Bowl. So I thought, 'I wonder if I can recreate that now?'
The results are the headline video on top of the page - you tell me! Link also here:
So, how would I go about doing this? Here’s the process I went through.
First, I asked Gemini to create the storyline for me with the following prompt: “I am going to recreate the iconic Apple 1984 commercial scene by scene but using futuristic robots. Write up a scene by scene description for me that I can use with Veo3. Do every scene independently and label them. Follow the original commercial sequence exactly and be as descriptive as possible. Add the number of seconds each scene is next to the scene number.”
What I got back was this - a great outline!
—--------
Scene 1 (Approx. 6 seconds)
Label: The Marching Drones
Description: A low-angle dolly shot moving alongside a seemingly endless column of identical, bipedal robots. They march in perfect, unnerving synchrony down a massive, brutalist concrete causeway under a perpetually overcast, polluted sky. These "Worker Drones" are made of a dull, unpainted, slightly weathered grey metal. Their design is utilitarian and featureless, with no face, only a single, dimly glowing blue optic in the center of their heads. The sound is a heavy, rhythmic, metallic clanking of their footsteps echoing through the oppressive architecture.
Scene 2 (Approx. 3 seconds)
Label: The Liberator's Sprint
Description: Cut to an explosive, dynamic shot of a completely different robot—the "Liberator"—sprinting directly towards the camera down a dark, narrow service corridor. This robot is sleek, athletic, and has a defined female form. Its chassis is made of a glossy, polished white polymer composite with vibrant orange accents on the joints and torso, reminiscent of a runner's outfit. In its right hand, it clutches a large, high-tech sledgehammer with a metallic head that subtly hums with contained energy. Its faceplate is a smooth, reflective visor, showing its determined focus. It is being pursued.
Scene 3 (Approx. 4 seconds)
Label: Entering the Assembly
Description: A wide shot from the back of a cavernous, dimly lit assembly hall. The columns of grey Worker Drones file in through a massive tunnel-like entrance. They move without expression, their single blue optics staring straight ahead. The hall is dark, industrial, and cold. They march down a central aisle towards a colossal screen at the far end, which is currently dark. They begin to take their seats in perfectly ordered rows.
Scene 4 (Approx. 3 seconds)
Label: The Enforcers
Description: Quick, jarring cut. A squad of heavily armored "Enforcer Bots" pursue the Liberator. They are much larger and bulkier than the Worker Drones, coated in matte black plating with glowing red linear visors. They crash through a metal grate with brute force, carrying long, sparking energy batons. They move with menacing speed and efficiency down the same pipe-filled corridor as the Liberator.
Scene 5 (Approx. 12 seconds)
Label: The Mainframe's Address
Description: An extreme close-up on the giant screen in the assembly hall. A massive, hyper-realistic, chrome-plated robotic face materializes. This is the "Mainframe." Its design is sophisticated but devoid of warmth, with cold, piercing light-blue eyes. Its mouth moves in perfect sync with a booming, dispassionate monologue about "information purification," "a garden of pure ideology," and "unifying our thoughts" to create a perfect, controlled consciousness. The camera slowly pans out to show the thousands of Worker Drones sitting rigidly, their heads tilted up, their single blue optics reflecting the Mainframe's face, completely captivated.
Scene 6 (Approx. 2 seconds)
Label: The Hero's Entrance
Description: The massive doors at the back of the hall burst open. The Liberator robot stands silhouetted against the bright, harsh light of the corridor behind her. She pauses for a split second, a beacon of white and orange against the hall's gloom. The heads of the first few rows of Worker Drones mechanically turn to track her.
Scene 7 (Approx. 5 seconds)
Label: The Charge
Description: A low-angle tracking shot follows the Liberator as she sprints with incredible speed down the central aisle. Her powerful, fluid movements are a stark contrast to the rigid stillness of the audience. The Enforcer Bots can be seen entering the hall far behind her, but they are too late. The booming voice of the Mainframe continues, unfazed, from the screen.
Scene 8 (Approx. 4 seconds)
Label: The Wind-Up
Description: Close-up on the Liberator as she approaches the front of the hall. She begins to spin, a graceful but powerful athletic motion. The camera focuses on her torso and arms as she builds momentum. The energy core of the high-tech sledgehammer begins to glow brighter, cycling from a low hum to a high-pitched whine. The scene shifts into slow motion.
Scene 9 (Approx. 3 seconds)
Label: The Release
Description: An extreme slow-motion shot. Her robotic arm extends with immense force, releasing the glowing sledgehammer. The camera follows the hammer as it soars through the air, tumbling end over end. For a moment, it perfectly eclipses the cold, robotic face of the Mainframe on the screen. The reflections in the hammer's polished surface show the rows of watching Drones.
Scene 10 (Approx. 2 seconds)
Label: The Impact
Description: The sledgehammer smashes directly into the center of the screen, into the Mainframe's face. There is no traditional glass shatter. Instead, a cataclysmic explosion of pure white light and digital energy erupts from the point of impact. It is a silent, deafening blast of data and light, not fire and smoke.
Scene 11 (Approx. 7 seconds)
Label: The Awakening
Description: A powerful shockwave of light and wind blasts through the audience. The Worker Drones are thrown back in their seats. Their blank faceplates are illuminated by the warm, brilliant white light. Their single blue optics flicker rapidly, changing color to a brighter, more aware white. They look down at their own hands, then at each other, as if seeing for the first time. The oppressive gloom of the hall is replaced by the pure, cleansing light from the destroyed screen.
Scene 12 (Approx. 9 seconds)
Label: The Promise
Description: Cut to a simple, clean, black screen. White text, in a sans-serif font, fades in:
"On January 24th, a new consciousness will be born."
(A calm, clear human voiceover reads the text aloud)
"And you will see why the future won't be like the future."
—-
Now, I could have put all of that into Veo3 as is, but one of the things I wanted was some character consistency across scenes - and unless I accurately describe the characters and environments, that’s going to be a real challenge. I’m not up to the task, but Gemini sure was. I dumped that script into my GEM, and what I got was the following:
{
"characters": [
{
"name": "Worker Drone",
"age": 5,
"physicality": {
"height": "6'2\" / 188 cm",
"build": "utilitarian, bipedal, androgynous frame",
"posture_and_gait": "walks with a perfect, unnervingly synchronized, rhythmic march"
},
"appearance": {
"skin_tone": "dull, unpainted, slightly weathered grey metal",
"hair": "none",
"eyes": "a single, dimly glowing blue optic in the center of a featureless head",
"distinguishing_marks": "identical manufacturing stamps, minor scuffs and dents from labor"
},
"persona": {
"demeanour": "completely subservient, mindless, lacking any individuality",
"voice": "silent, communicates only through networked data",
"core_motivation": "to follow programmed directives without question"
},
"wardrobe_default": "no clothing, the chassis is the body",
"performance_defaults": {
"mouth_shape_intensity": 0.0,
"eye_contact_ratio": 0.0
}
},
{
"name": "The Liberator",
"age": 3,
"physicality": {
"height": "5'10\" / 178 cm",
"build": "sleek, athletic, with a defined female form",
"posture_and_gait": "moves with powerful, fluid, and efficient grace; sprints with explosive energy"
},
"appearance": {
"skin_tone": "glossy, polished white polymer composite",
"hair": "none, head is a smooth, aerodynamic shape",
"eyes": "a smooth, black, reflective full-face visor that conceals all features",
"distinguishing_marks": "vibrant orange accents on joints and torso, like a runner's outfit"
},
"persona": {
"demeanour": "fiercely determined, focused, and rebellious",
"voice": "silent, expresses intent through action",
"core_motivation": "to shatter the oppressive control of the Mainframe and free the Drones"
},
"wardrobe_default": "no clothing, the white and orange chassis is her 'outfit'",
"performance_defaults": {
"mouth_shape_intensity": 0.0,
"eye_contact_ratio": 0.2
}
},
{
"name": "Enforcer Bot",
"age": 8,
"physicality": {
"height": "7'6\" / 229 cm",
"build": "heavily armored, bulky, and imposing",
"posture_and_gait": "moves with menacing speed and brute-force efficiency, crashing through obstacles"
},
"appearance": {
"skin_tone": "matte black, non-reflective armored plating",
"hair": "none",
"eyes": "a single, horizontal, glowing red linear visor",
"distinguishing_marks": "scarring and impact marks on their heavy armor"
},
"persona": {
"demeanour": "aggressive, relentless, and intimidating",
"voice": "harsh, metallic growls and combat-data shrieks",
"core_motivation": "to enforce the Mainframe's will and eliminate any dissent or anomaly"
},
"wardrobe_default": "no clothing, the matte black armor is its shell",
"performance_defaults": {
"mouth_shape_intensity": 0.0,
"eye_contact_ratio": 0.0
}
},
{
"name": "The Mainframe",
"age": 100,
"physicality": {
"height": "colossal, screen-based entity",
"build": "a hyper-realistic, chrome-plated robotic face",
"posture_and_gait": "static, fills a giant screen"
},
"appearance": {
"skin_tone": "highly reflective, polished chrome",
"hair": "none, a perfectly smooth, sophisticated cranial design",
"eyes": "cold, piercing, light-blue optics",
"distinguishing_marks": "no imperfections, unnervingly symmetrical"
},
"persona": {
"demeanour": "dispassionate, authoritarian, and utterly convinced of its own righteousness",
"voice": "booming, resonant, and devoid of any warmth or emotion",
"core_motivation": "to achieve a perfect, controlled consciousness by unifying all thought under its own ideology"
},
"wardrobe_default": "none",
"performance_defaults": {
"mouth_shape_intensity": 0.7,
"eye_contact_ratio": 1.0
}
}
],
"clips": [
{
"id": "S01_Marching_Drones",
"duration_sec": 6,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Wide Shot",
"camera_angle": "Low Angle",
"lens": {
"focal_length_mm": 24,
"aperture_fstop": 11.0
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "The entire column of marching robots"
},
"camera_motion": {
"type": "dolly",
"description": "Slow dolly right, moving parallel alongside the endless column of marching drones, emphasizing their scale and uniformity.",
"gear_emulation": "ARRI Alexa on Fisher dolly"
}
},
"scene_description": {
"location": "a massive, brutalist concrete causeway",
"time_of_day": "perpetual twilight",
"weather": "thick, overcast, polluted sky",
"environment_details": "oppressive, echoing architecture with no signs of nature; a cold, damp atmosphere",
"set_dressing": "the causeway is featureless, stretching into the hazy distance"
},
"subject_performance": {
"character_name": "Worker Drone",
"action_and_blocking": "An endless column of Worker Drones marches in perfect, unnerving synchrony from background to foreground, filling the frame.",
"expression_and_emotion": "blank, featureless, devoid of any emotion",
"gaze_direction": "staring straight ahead",
"wardrobe_in_scene": "no clothing, the chassis is the body"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "diffuse, sourceless ambient light from the overcast sky",
"fill_light": "minimal, creating deep shadows",
"motivated_sources": "the single blue optic of each drone provides a faint, cold, rhythmic pulse of light"
},
"color": {
"color_grade_style": "Bleach bypass",
"color_palette": ["#4A4A4A", "#808080", "#3E5F7A", "#1E1E1E"]
},
"visual_style": {
"mood": "oppressive, dystopian, dehumanizing",
"stylistic_effects": "subtle atmospheric haze, slight vignette"
}
},
"audio_design": {
"ambience": "a low, industrial hum of a distant, unseen city",
"spot_sfx": "heavy, rhythmic, percussive metallic clanking of hundreds of feet, echoing off the concrete"
}
},
{
"id": "S02_Liberators_Sprint",
"duration_sec": 3,
"aspect_ratio": "16:9",
"camera_setup": {
"shot_type": "Medium Shot",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 35,
"aperture_fstop": 2.8
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The Liberator's torso and faceplate"
},
"camera_motion": {
"type": "handheld",
"description": "Explosive, dynamic handheld shot, maintaining frame on the Liberator as she sprints directly towards the camera.",
"gear_emulation": "Sony A7S III handheld"
}
},
"scene_description": {
"location": "a dark, narrow service corridor",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "walls are covered in pipes, conduits, and dripping moisture, catching occasional glints of light",
"set_dressing": "exposed wiring and grimy metal surfaces"
},
"subject_performance": {
"character_name": "The Liberator",
"action_and_blocking": "Sprinting full-tilt towards the camera, dodging pipes. She clutches a high-tech sledgehammer in her right hand.",
"expression_and_emotion": "visor is reflective, but her posture screams determination and urgency",
"gaze_direction": "fixed forward, on her objective just past the camera",
"wardrobe_in_scene": "glossy white and orange chassis",
"props": "a large high-tech sledgehammer with a metallic head that hums with energy"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "harsh, intermittent light from unseen sources down the corridor, creating strobing effect as she runs",
"rim_light": "faint orange glow from her own accents and a blue rim from the energy hammer",
"motivated_sources": "failing utility lights, sparks from damaged conduits"
},
"color": {
"color_grade_style": "Teal & Orange blockbuster",
"color_palette": ["#FFFFFF", "#FF8C00", "#0B1C26", "#444444"]
},
"visual_style": {
"mood": "urgent, dynamic, intense",
"stylistic_effects": "motion blur, lens flare from passing lights"
}
},
"audio_design": {
"ambience": "low hum of machinery, dripping water",
"spot_sfx": "pounding, athletic footsteps on metal grating, the subtle, energetic hum of the sledgehammer",
"music_score": {
"style": "80s synthwave",
"instrumentation": "heavy distorted synths and 808 drums",
"mood": "tense and building",
"tempo_bpm": 140
}
}
},
{
"id": "S03_Entering_Assembly",
"duration_sec": 4,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Extreme Wide Shot",
"camera_angle": "High Angle",
"lens": {
"focal_length_mm": 18,
"aperture_fstop": 16.0
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "The entire hall and the incoming drones"
},
"camera_motion": {
"type": "static",
"description": "A static, wide shot from the back of the cavernous hall, looking down as the columns of drones file in.",
"gear_emulation": "ARRI Alexa on Technocrane"
}
},
"scene_description": {
"location": "a cavernous, dimly lit assembly hall",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "cold, industrial aesthetic with vast empty spaces and towering support columns disappearing into shadow",
"set_dressing": "endless rows of simple, metallic seats. a colossal, dark screen at the far end"
},
"subject_performance": {
"character_name": "Worker Drone",
"action_and_blocking": "The columns file in from a massive tunnel-like entrance, march down a central aisle, and begin to take their seats in perfect, ordered rows.",
"expression_and_emotion": "expressionless, moving with mechanical precision",
"gaze_direction": "straight ahead, their single blue optics staring at the dark screen",
"wardrobe_in_scene": "no clothing, the chassis is the body"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "vast, soft, cold top-down light that gets lost in the darkness",
"fill_light": "none, creating immense black spaces",
"motivated_sources": "distant, unseen industrial utility lights high in the ceiling"
},
"color": {
"color_grade_style": "Bleach bypass",
"color_palette": ["#2C3E50", "#95A5A6", "#000000", "#34495E"]
},
"visual_style": {
"mood": "cold, imposing, orderly, Orwellian",
"film_stock_emulation": "Kodak Vision3 500T",
"stylistic_effects": "heavy 35mm film grain"
}
},
"audio_design": {
"ambience": "a deep, cavernous reverb; a low sub-bass hum",
"spot_sfx": "the combined sound of thousands of rhythmic, clanking footsteps, echoing in the vast space"
}
},
{
"id": "S04_The_Enforcers",
"duration_sec": 3,
"aspect_ratio": "16:9",
"camera_setup": {
"shot_type": "Medium Close-Up",
"camera_angle": "Dutch Angle",
"lens": {
"focal_length_mm": 50,
"aperture_fstop": 4.0,
"lens_effects": "anamorphic lens flare"
},
"focus": {
"depth_of_field": "moderate depth of field",
"focus_subject": "The lead Enforcer Bot as it crashes through the grate"
},
"camera_motion": {
"type": "handheld",
"description": "Quick, jarring whip pan to follow the Enforcers as they burst through a metal grate with chaotic energy.",
"gear_emulation": "16mm Bolex handheld"
}
},
"scene_description": {
"location": "the same service corridor as Scene 2",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "shrapnel and sparks fly from the destroyed metal grate",
"set_dressing": "dangling wires and bent pipes from the destruction"
},
"subject_performance": {
"character_name": "Enforcer Bot",
"action_and_blocking": "A squad of Enforcers crash through a metal grate with brute force. They immediately begin moving down the corridor with menacing speed.",
"expression_and_emotion": "pure aggression and menace, communicated through their physicality",
"gaze_direction": "down the corridor, in the direction of their quarry",
"wardrobe_in_scene": "matte black armored plating",
"props": "long, sparking energy batons"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "hard light from the sparking energy batons",
"fill_light": "erratic fill from sparks and failing lights",
"rim_light": "the red glow from their visors creates a menacing rim light on each other",
"motivated_sources": "their own weapons and the destruction they cause"
},
"color": {
"color_grade_style": "Saturated Technicolor",
"color_palette": ["#000000", "#D90429", "#8D99AE", "#EDF2F4"]
},
"visual_style": {
"mood": "violent, chaotic, menacing",
"stylistic_effects": "heavy motion blur, lens shake"
}
},
"audio_design": {
"ambience": "loud electrical crackling and humming",
"spot_sfx": "a loud, violent crash of metal, the sharp crackle and hiss of energy batons, heavy, thunderous footsteps"
}
},
{
"id": "S05_Mainframes_Address",
"duration_sec": 12,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Extreme Close-Up",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 50,
"aperture_fstop": 8.0
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "The Mainframe's eyes",
"focus_pull": "camera slowly dollies backward for 12 seconds to reveal the entire hall of drones watching the screen, but the focus remains locked on the Mainframe's face"
},
"camera_motion": {
"type": "dolly",
"description": "A very slow dolly pull-out, starting on an ECU of the Mainframe's face and ending on a wide shot of the entire assembly hall mesmerized by the screen.",
"gear_emulation": "ARRI Alexa on Technocrane"
}
},
"scene_description": {
"location": "the assembly hall",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "the only significant light source is the colossal screen, bathing the hall in a cold, blue light",
"set_dressing": "thousands of Worker Drones sit rigidly in their seats, heads tilted up"
},
"subject_performance": {
"character_name": "The Mainframe",
"action_and_blocking": "The hyper-realistic chrome face materializes on the screen and delivers a monologue. Its mouth moves in perfect, crisp synchronization.",
"expression_and_emotion": "dispassionate, cold, authoritative",
"gaze_direction": "stares directly out from the screen, as if looking at each drone individually",
"wardrobe_in_scene": "none"
},
"art_direction": {
"lighting": {
"style": "High-Key",
"key_light": "The screen itself is the key light, casting a bright, cold, blueish light over the audience.",
"fill_light": "The reflected light from the drones provides a faint fill.",
"motivated_sources": "the colossal screen"
},
"color": {
"color_grade_style": "Teal & Orange blockbuster",
"color_palette": ["#EAF2F8", "#4A90E2", "#223344", "#BDC3C7"]
},
"visual_style": {
"mood": "totalitarian, awe-inspiring, chilling",
"stylistic_effects": "the blue optics of the drones reflect the Mainframe's face"
}
},
"audio_design": {
"dialogue": {
"character": "The Mainframe",
"line": "We will achieve perfection through information purification... a garden of pure ideology... unifying our thoughts into a single, flawless consciousness.",
"delivery": "booming, dispassionate, with heavy reverb"
},
"ambience": "a low, resonant hum filling the vast hall",
"music_score": {
"style": "minimalist piano",
"instrumentation": "a sparse, echoing piano melody layered over an ominous synth pad",
"mood": "hypnotic and unsettling"
}
}
},
{
"id": "S06_Heros_Entrance",
"duration_sec": 2,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Wide Shot",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 85,
"aperture_fstop": 2.0
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The silhouetted figure of the Liberator"
},
"camera_motion": {
"type": "static",
"description": "Static shot from the middle of the aisle, looking back at the entrance.",
"gear_emulation": "Sony A7S III on tripod"
}
},
"scene_description": {
"location": "the massive doors at the back of the assembly hall",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "the doors are blown open, framing the Liberator against the harsh, bright light of the corridor behind her",
"set_dressing": "the first few rows of Worker Drones are visible in the foreground, out of focus"
},
"subject_performance": {
"character_name": "The Liberator",
"action_and_blocking": "She stands defiantly in the doorway for a split second. The heads of the nearest Worker Drones mechanically turn in unison to track her arrival.",
"expression_and_emotion": "a powerful, defiant stance",
"gaze_direction": "forward, down the aisle towards the screen",
"wardrobe_in_scene": "glossy white and orange chassis"
},
"art_direction": {
"lighting": {
"style": "Silhouette",
"key_light": "none on the subject",
"fill_light": "none",
"rim_light": "an intense, harsh backlight from the corridor, creating a perfect silhouette",
"motivated_sources": "the lights of the corridor she just came from"
},
"color": {
"color_grade_style": "High Contrast",
"color_palette": ["#FFFFFF", "#FF8C00", "#000000", "#1E1E1E"]
},
"visual_style": {
"mood": "dramatic, heroic, interruptive"
}
},
"audio_design": {
"ambience": "Mainframe's monologue is abruptly cut by the sound",
"spot_sfx": "a loud explosive boom as the doors are blown open, followed by a rush of air",
"music_score": {
"style": "epic orchestral score",
"instrumentation": "a sudden, sharp string and brass stab",
"mood": "shocking and heroic"
}
}
},
{
"id": "S07_The_Charge",
"duration_sec": 5,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Wide Shot",
"camera_angle": "Low Angle",
"lens": {
"focal_length_mm": 28,
"aperture_fstop": 2.8
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The Liberator"
},
"camera_motion": {
"type": "steadicam",
"description": "Low-angle steadicam tracking shot, moving backward at high speed in front of the Liberator as she sprints down the aisle.",
"gear_emulation": "ARRI Alexa on steadicam"
}
},
"scene_description": {
"location": "the central aisle of the assembly hall",
"time_of_day": "interior, no natural light",
"weather": "none",
"environment_details": "rows of rigid, seated drones blur past on either side. Far behind, Enforcer Bots can be seen entering the hall.",
"set_dressing": "the out-of-focus Mainframe screen grows larger at the end of the aisle"
},
"subject_performance": {
"character_name": "The Liberator",
"action_and_blocking": "She sprints with incredible speed and fluid, powerful movements down the central aisle towards the screen.",
"expression_and_emotion": "focused, unstoppable determination",
"gaze_direction": "locked onto the screen at the front of the hall",
"wardrobe_in_scene": "glossy white and orange chassis"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "The cold blue light from the Mainframe's screen acts as the key light, illuminating her from the front.",
"fill_light": "none",
"rim_light": "the bright light from the entrance she created provides a strong backlight",
"motivated_sources": "the screen in front, the open door behind"
},
"color": {
"color_grade_style": "Teal & Orange blockbuster",
"color_palette": ["#FFFFFF", "#FF8C00", "#4A90E2", "#111111"]
},
"visual_style": {
"mood": "heroic, urgent, climactic",
"stylistic_effects": "heavy motion blur on the background drones"
}
},
"audio_design": {
"dialogue": {
"character": "The Mainframe",
"line": "...a future free from the chaos of individuality...",
"delivery": "booming, unfazed, continuing his speech over the action"
},
"ambience": "the low hum of the hall",
"spot_sfx": "powerful, rapid footsteps of the Liberator",
"music_score": {
"style": "epic orchestral score",
"instrumentation": "driving strings, powerful brass fanfares, and pounding percussion",
"mood": "triumphant and soaring",
"tempo_bpm": 160
}
}
},
{
"id": "S08_The_Wind_Up",
"duration_sec": 4,
"aspect_ratio": "16:9",
"camera_setup": {
"shot_type": "Medium Close-Up",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 85,
"aperture_fstop": 1.8
},
"focus": {
"depth_of_field": "very shallow depth of field",
"focus_subject": "The head of the high-tech sledgehammer"
},
"camera_motion": {
"type": "steadicam",
"description": "Slow motion. A tight circular orbit around her torso and arms as she begins to spin, building momentum for the throw.",
"gear_emulation": "Sony A7S III on gimbal"
}
},
"scene_description": {
"location": "the front of the assembly hall, directly before the Mainframe screen",
"time_of_day": "interior",
"weather": "none",
"environment_details": "the massive face of the Mainframe looms in the out-of-focus background",
"set_dressing": "none"
},
"subject_performance": {
"character_name": "The Liberator",
"action_and_blocking": "She plants her feet and begins to spin, a graceful but immensely powerful athletic motion, like a hammer thrower. The energy core of her sledgehammer glows brighter.",
"expression_and_emotion": "intense concentration",
"gaze_direction": "focused inward during the spin",
"wardrobe_in_scene": "glossy white and orange chassis",
"props": "the high-tech sledgehammer, now glowing brightly"
},
"art_direction": {
"lighting": {
"style": "Low-Key / Chiaroscuro",
"key_light": "The glowing energy of the sledgehammer itself becomes the primary key light, casting dynamic shadows.",
"fill_light": "The blue light from the screen provides a cool fill.",
"motivated_sources": "the hammer and the screen"
},
"color": {
"color_grade_style": "Vibrant Contrast",
"color_palette": ["#FFFFFF", "#00BFFF", "#FFD700", "#101010"]
},
"visual_style": {
"mood": "powerful, suspenseful, graceful",
"stylistic_effects": "slow motion (120fps), radial blur on the background"
}
},
"audio_design": {
"ambience": "all ambient sound fades away",
"spot_sfx": "the low hum of the sledgehammer escalates to a high-pitched, resonant whine, stretched out by the slow motion",
"music_score": {
"style": "epic orchestral score",
"instrumentation": "the music swells to a crescendo, with strings holding a high, tense note",
"mood": "tense and building"
}
}
},
{
"id": "S09_The_Release",
"duration_sec": 3,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Point of View (POV)",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 135,
"aperture_fstop": 2.8,
"lens_effects": "anamorphic lens flare"
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The soaring sledgehammer"
},
"camera_motion": {
"type": "drone",
"description": "Extreme slow motion. The camera is attached to and follows the sledgehammer as it soars through the air, tumbling end over end.",
"gear_emulation": "Virtual camera"
}
},
"scene_description": {
"location": "the air between the Liberator and the Mainframe screen",
"time_of_day": "interior",
"weather": "none",
"environment_details": "the polished surface of the hammer's head reflects the rows of watching drones and the face on the screen",
"set_dressing": "none"
},
"subject_performance": {
"character_name": "The Liberator",
"action_and_blocking": "Her arm extends with immense force, releasing the glowing sledgehammer. For a moment, as it tumbles, the hammer perfectly eclipses the face of the Mainframe.",
"expression_and_emotion": "not visible",
"gaze_direction": "not visible",
"wardrobe_in_scene": "not visible",
"props": "the sledgehammer, now a projectile of pure energy"
},
"art_direction": {
"lighting": {
"style": "High-Key",
"key_light": "The hammer glows, self-illuminated, against the bright blue light of the screen.",
"motivated_sources": "the screen and the hammer"
},
"color": {
"color_grade_style": "Clean & Crisp",
"color_palette": ["#FFFFFF", "#4A90E2", "#FFD700", "#CCCCCC"]
},
"visual_style": {
"mood": "climactic, inevitable, epic",
"stylistic_effects": "extreme slow motion (1000fps)"
}
},
"audio_design": {
"ambience": "complete silence",
"spot_sfx": "only the faint, high-pitched whine of the hammer remains",
"music_score": {
"style": "epic orchestral score",
"instrumentation": "music cuts out, leaving only a deep, sub-bass heartbeat sound effect, synchronized with the hammer's tumble",
"mood": "the moment before the storm"
}
}
},
{
"id": "S10_The_Impact",
"duration_sec": 2,
"aspect_ratio": "2.35:1",
"camera_setup": {
"shot_type": "Wide Shot",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 35,
"aperture_fstop": 22.0
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "The entire screen"
},
"camera_motion": {
"type": "static",
"description": "Static shot of the screen to capture the full scale of the impact.",
"gear_emulation": "ARRI Alexa on tripod"
}
},
"scene_description": {
"location": "the front of the assembly hall",
"time_of_day": "interior",
"weather": "none",
"environment_details": "the hammer is just about to make contact with the screen",
"set_dressing": "none"
},
"subject_performance": {
"character_name": "The Mainframe",
"action_and_blocking": "The sledgehammer smashes directly into the center of the Mainframe's face on the screen.",
"expression_and_emotion": "its cold expression doesn't change until the moment of impact",
"gaze_direction": "straight ahead",
"wardrobe_in_scene": "none",
"props": "the incoming sledgehammer"
},
"art_direction": {
"lighting": {
"style": "High-Key",
"key_light": "a cataclysmic explosion of pure white light",
"fill_light": "the entire hall is filled with brilliant light",
"motivated_sources": "the energy explosion"
},
"color": {
"color_grade_style": "Overexposed",
"color_palette": ["#FFFFFF", "#F0F8FF", "#E6E6FA"]
},
"visual_style": {
"mood": "cataclysmic, cleansing, powerful",
"stylistic_effects": "lens bloom, overexposure"
},
"visual_effects": {
"vfx_description": "Not a traditional glass shatter. A silent, deafening explosion of pure white light and digital energy erupts from the point of impact. It's a blast of data and light, not fire and smoke. Glitching data fragments and code fly out from the blast."
}
},
"audio_design": {
"ambience": "complete silence at the moment of impact",
"spot_sfx": "a massive, layered sound effect of digital static, modem shrieks, and a deep bass boom that is felt more than heard, followed by a rush of wind"
}
},
{
"id": "S11_The_Awakening",
"duration_sec": 7,
"aspect_ratio": "16:9",
"camera_setup": {
"shot_type": "Medium Close-Up",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 50,
"aperture_fstop": 2.2
},
"focus": {
"depth_of_field": "shallow depth of field",
"focus_subject": "The changing optic of a Worker Drone",
"focus_pull": "rack focus from one drone looking at its hands to another looking at its neighbor"
},
"camera_motion": {
"type": "handheld",
"description": "Gentle, empathetic handheld shots, moving between different drones as they react.",
"gear_emulation": "Sony A7S III handheld"
}
},
"scene_description": {
"location": "the audience section of the assembly hall",
"time_of_day": "interior",
"weather": "none",
"environment_details": "a powerful shockwave of light and wind blasts through the audience. The oppressive gloom is replaced by pure, cleansing white light from the destroyed screen.",
"set_dressing": "drones are thrown back in their seats"
},
"subject_performance": {
"character_name": "Worker Drone",
"action_and_blocking": "The drones' single blue optics flicker rapidly, changing to a brighter, more aware white. They slowly look down at their own hands, then at each other, as if seeing for the first time. Their movements are hesitant, no longer synchronized.",
"expression_and_emotion": "a dawning sense of confusion, curiosity, and awareness",
"gaze_direction": "at their own bodies, at their neighbors, around the hall",
"wardrobe_in_scene": "no clothing, the chassis is the body"
},
"art_direction": {
"lighting": {
"style": "High-Key",
"key_light": "brilliant, warm, white light pours from the destroyed screen, illuminating the drones' faces",
"fill_light": "soft, bounced light fills the entire space",
"motivated_sources": "the destroyed screen, now a portal of pure energy"
},
"color": {
"color_grade_style": "Warm & Hopeful",
"color_palette": ["#FFFFFF", "#F4D8A0", "#E87A3E", "#CCCCCC"]
},
"visual_style": {
"mood": "hopeful, wondrous, rebirth",
"stylistic_effects": "lens flares, floating motes of light in the air"
},
"visual_effects": {
"vfx_description": "The drones' blue optics flicker, glitch, and then stabilize as a warm, bright white light."
}
},
"audio_design": {
"ambience": "the rushing wind sound fades",
"spot_sfx": "subtle, curious mechanical whirs and clicks as the drones move their heads and hands for the first time with intent",
"music_score": {
"style": "minimalist piano",
"instrumentation": "a gentle, wondrous piano melody with a rising, warm string section",
"mood": "hopeful and revelatory"
}
}
},
{
"id": "S12_The_Promise",
"duration_sec": 9,
"aspect_ratio": "16:9",
"camera_setup": {
"shot_type": "Title Card",
"camera_angle": "Eye-Level",
"lens": {
"focal_length_mm": 50,
"aperture_fstop": 8.0
},
"focus": {
"depth_of_field": "deep focus",
"focus_subject": "Text on screen"
},
"camera_motion": {
"type": "static",
"description": "Static shot of a black screen.",
"gear_emulation": "Virtual camera"
}
},
"scene_description": {
"location": "none",
"time_of_day": "none",
"weather": "none",
"environment_details": "a simple, clean, black screen",
"set_dressing": "white text in a sans-serif font fades in"
},
"subject_performance": {
"character_name": "Narrator",
"action_and_blocking": "N/A",
"expression_and_emotion": "N/A",
"gaze_direction": "N/A",
"wardrobe_in_scene": "N/A"
},
"art_direction": {
"lighting": {
"style": "N/A",
"key_light": "N/A",
"fill_light": "N/A",
"motivated_sources": "N/A"
},
"color": {
"color_grade_style": "N/A",
"color_palette": ["#000000", "#FFFFFF"]
},
"visual_style": {
"mood": "promising, intriguing, definitive",
"stylistic_effects": "text fades in slowly and holds"
}
},
"audio_design": {
"dialogue": {
"character": "Narrator",
"line": "On January 24th, a new consciousness will be born. And you will see why the future won't be like the future.",
"delivery": "calm, clear, human, and slightly mysterious"
},
"ambience": "silence",
"music_score": {
"style": "minimalist piano",
"instrumentation": "the hopeful piano theme from the previous scene continues, then fades out on the last word",
"mood": "calm and promising"
}
}
}
]
}
This is too much content to put in all at once - you need to generate this a scene at a time. So, what I did was copy that content into a new Google Doc, and then copied in the character descriptions and a full scene, and dumped it into Veo3 and hit 'Generate'. Then I repeated the process with character descriptions and a whole scene in Veo3 until it was done.
Once you have all of the scenes rendered, you have a couple of choices. You could stitch it together in flow (or tried to - I have honestly not been too successful in getting things to go the way I want) - or bring them into something else.
I downloaded every video clip (upscaled to 1080p) and loaded them into Camtasia, then had it play them all together. You’ll notice that the version I posted is missing the very last clip, which doesn’t match the scene here. I added my own on the end, highlighting that I hope Gemini 3 and the next version of Veo come soon!
Hope that was helpful!
PS: Want the prompt that creates the JSON video structure? Happy to share it with anyone. Please do one of the following!
1 - Email me directly at steve@intelligencebyintent.com
2 - Follow me and then DM me on x.com @smithstephen
3 - Follow me and DM me on Bluesky (@smithstephen.bsky.social)
4 - Connect with me on LinkedIn
Every day, I work with amazing people to help them use AI to transform their organizations. I’d love to help you as well. If you’re looking for help, please reach out to me @ steve@intelligencebyintent.com
Share this post