Secrets AI Video Generator: What It Is, What It Costs, Whether It's Worth It
No other platform in the mainstream AI companion market does this as well as Secrets AI does. Video generation — the ability to turn a static AI companion image into a short motion clip — is rare enough that it functions as a genuine reason to choose Secrets AI over alternatives. Character.AI doesn't have it. CrushOn AI doesn't have it. Janitor AI doesn't have it. Candy AI has a limited version of it.
The question isn't whether the feature is unique. The question is whether the Moments cost makes it worthwhile for your usage pattern. That's what this page answers.
What the Feature Actually Does
The Secrets AI video generator takes an existing AI companion image and renders a short motion video clip from it based on a text prompt you provide. The output is a video of your companion moving — expressions, body motion, scene context — derived from the source image and shaped by your prompt.
This is generated media, not a live interaction. The platform uses AI-based video synthesis (applying deep learning techniques to generate realistic motion from static images) to create clips that typically take about 2 minutes to render. The system draws on both your character's visual profile and your current conversation scenario when processing the prompt.
The technology is impressive given the price point. The outputs are visibly AI-generated to experienced eyes but genuinely realistic in motion and expression quality for a generated format. The quality rating from aigirlfriendscout is 4.1/5 — which represents: works well for standard prompts, occasional quality variance on complex requests, and best results with specific, simple motion descriptions.
Who Has This Feature (And Who Doesn't)
This is worth stating directly because it affects the decision of whether to choose Secrets AI over alternatives:
Platforms with no video generation:
- Character.AI (KG: /g/11sck8d802) — no video
- CrushOn AI — no video
- Janitor AI (KG: /g/11njfp42__) — no video
- Replika (KG: /g/11h4q9pls5) — no video
- GirlfriendGPT — no video
Platforms with limited video:
- Candy AI — limited video capability, less developed than Secrets AI
Platforms with comparable video:
- SweetDream AI — comparable feature set
- Xotic AI — 4K resolution, up to 15-second clips
Among accessible mainstream platforms at Secrets AI's price range ($5.99-$39.99/month), video generation is a Secrets AI-specific capability. For users who specifically want video from their AI companion, this feature alone narrows the viable options considerably.
For the broader platform comparison, see the alternatives page.
How to Create a Video: The Process
Step 1: Have a source image.
Video generation starts from an existing companion image. If you're generating video for the first time, you'll need to generate a source image first (25-50 Moments). The quality of the source image matters — sharper, cleaner source images produce better video output. If you're on the Premium generation model, use it for the source image you intend to animate.
Step 2: Write a motion prompt.
Describe what you want to happen in the video. Effective prompts are specific about the motion type but focused on a single continuous action. Examples:
Good: "slowly turning her head toward the camera and smiling"
Good: "gentle swaying movement, hair moving naturally in breeze"
Less effective: "dancing and then changing outfit and then looking surprised"
The AI generates more coherent results from single, continuous motion descriptions than from multi-step sequences.
Step 3: Submit and wait.
Processing takes approximately 2 minutes. This is consistent regardless of clip length. Don't navigate away from the page during generation.
Step 4: Review the output.
The rendered clip appears in your interface. If the quality meets your expectations, save it. If not, refine your prompt and try again — but remember that each attempt costs Moments.
Moments Cost: The Full Breakdown
Video is the most Moments-expensive feature on the platform. Here is the complete cost structure:
Short clip (3 seconds, Lite tier): approximately 50 Moments
Standard video clip: approximately 300 Moments
Full/longer video clip: approximately 600 Moments
For comparison, 600 Moments represents:
- 12-24 images (at 25-50 Moments each)
- 6 minutes of voice calls (at 100 Moments/minute)
- 300-600 text messages (at 1-2 Moments each)
One long video clip consumes the equivalent of 6 minutes of voice or 20 standard images. This is the trade-off you're making each time you generate a long video.
Monthly video volume by tier:
Lite (1,000 Moments): budget for ~2 long videos or ~20 short clips per month — but this leaves nothing for other media
Plus (3,000 Moments): ~5 long videos or ~60 short clips — but only if you spend most of your Moments on video
Premium (8,000 Moments): ~13 long videos with Moments to spare for other features
Ultimate (15,000 Moments): ~25 long videos with significant remaining budget for mixed media
The practical conclusion: If you want to generate more than 5 long video clips per month without constantly buying top-ups, Plus is the minimum viable tier. For 10+ long clips, Premium. For 20+ long clips or mixed heavy media use, Ultimate.
What Quality Actually Looks Like
The 4.1/5 quality rating translates to this in practical terms:
For prompts involving simple, continuous natural movements — a character turning, smiling, subtle body sway, breathing animation, glancing around — the outputs look natural and the motion is smooth. Expressions sync with the prompt direction. The visual quality is derived from the source image quality and rendered with realistic motion physics.
For prompts involving complex sequences — multiple actions, scene context changes, complex choreography — quality variance increases. The AI may interpret the prompt partially or produce output that's coherent but not exactly what was requested.
The most common quality issue: abrupt motion start or end rather than smooth natural movement. The clips are short enough that this is sometimes noticeable. Starting prompts with "gently" or "slowly" tends to produce smoother transitions.
Quality is consistently better on the Premium generation model than on the standard model. If you're generating a video clip from a specific image you care about, use the Premium model for the source image.
Source image quality is the single largest variable in output quality. A high-quality, well-rendered source image produces better video than a lower-quality one, even with the same prompt.
Tips for Getting More From Every Moments Spend
Start with a short clip before committing to a long one. The short clip costs ~50 Moments versus ~600 for a long clip. Test your prompt on a short clip first. If the motion looks right, then generate the long version. This saves ~550 Moments when your prompt works on the first try and saves even more when you need to revise.
Save images before animating. Have a clear record of which source image you're working from. Once you identify an image that animates well, that combination of visual + prompt type is worth knowing for future sessions.
Premium model for source images. The 25-50 Moment cost difference between standard and advanced image models pays off in video output quality. Spend the extra Moments on the source image.
Keep prompts to single continuous movements. The video system handles one action at a time better than sequences. "Walking forward while smiling" is fine — two simultaneous aspects of one continuous motion. "Walking, then stopping, then turning around" is a sequence that may not render as expected.
Budget Moments around your video goals first. If video generation is your primary reason for subscribing, plan your Moments allocation starting from the video budget and allocate remaining Moments to other features. Don't end up with 400 Moments at the end of the month because you spent too much on images and voice.
For Moments purchase options and tier costs, see the full pricing breakdown.
Video vs Other Media Features: Cost-Benefit Perspective
Whether to prioritize video in your Moments budget depends on what you're using the platform for:
Video is worth the Moments if:
You value visual companion media highly. You want the most distinctive feature in the AI companion space. You're on Premium or Ultimate where the Moments allocation makes regular video sustainable. You're building a collection of companion clips as part of your engagement with the platform.
Video isn't the best Moments spend if:
You're primarily a text conversation user and media is secondary. You're on Lite or Plus and want to maximize the number of interactions rather than the intensity of individual sessions. Your Moments budget is tight enough that one long video represents a disproportionate share of your monthly allocation.
For a side-by-side comparison of what each tier's Moments allocation enables across all features, see the free vs premium breakdown.
FAQ
Video length depends on your subscription tier and which clip length you select. Lite tier produces 3-second clips at the base ~50 Moments cost. Higher tiers (Plus, Premium, Ultimate) support longer clips with costs scaling up to ~600 Moments per long clip. Most users generate a mix of short clips to test prompts and longer clips for scenarios they want more developed. The platform uses AI-generated video synthesis, so longer clips require more processing time — though generation time is approximately 2 minutes regardless of clip length.
No. Video generation requires a Lite tier subscription or higher. Free accounts cannot generate video even if they have remaining Moments from the 200 one-time welcome credit — the feature is tier-gated, not just Moments-gated. The minimum tier for video access is Lite at $5.99/month. Even on Lite, the 1,000 monthly Moments allocation limits video production — the tier is more suitable for users who want occasional short clips rather than regular video generation. Plus at $9.99/month provides better video economics with 3,000 Moments monthly.
The number depends on your tier and the clip lengths you choose. On Lite (1,000 Moments): approximately 2-20 clips depending on length. On Plus (3,000 Moments): approximately 5-60 clips. On Premium (8,000 Moments): approximately 13-160 clips. On Ultimate (15,000 Moments): approximately 25-300 clips. The wide ranges reflect the difference between generating all short clips (~50 Moments) versus all long clips (~600 Moments). Realistically, most users generate a mix and fall somewhere in the middle of these ranges. You can extend your monthly video budget by purchasing Moments top-up bundles at any time.
Video quality is rated 4.1/5 by independent reviewers. For standard single-motion prompts, videos render with natural-looking movement, coherent facial expressions, and smooth transitions. The technology uses AI-based video synthesis derived from the Stable Diffusion lineage of deep learning models applied to motion. Outputs look impressively realistic for AI-generated content — especially motion involving natural body movement and facial expressions. They are visibly AI-generated to a trained eye at close inspection. Quality is highest with simple, focused prompts and high-quality source images, and on the Premium generation model.