O
Octo
O
Octo
CoursesPricingDashboardPrivacyTerms

© 2026 Octo

AI Tools Masterclass
1How to Use Claude2No-Code AI Tools3AI for Spreadsheets4AI Image & Video Tools5GitHub Copilot & Coding Assistants6AI for Presentations7AI Meeting & Productivity Tools8AI for Research & Analysis
Module 4

AI Image & Video Tools

From Midjourney to Runway, AI can now generate stunning images and videos from text. Here's how to use these tools for real work — and avoid the pitfalls.

The pitch deck that almost didn't happen

Marcus runs a small design agency. A startup founder calls at 3 PM on Thursday: "We're pitching investors tomorrow at 10 AM. We need 12 custom illustrations for our pitch deck — futuristic healthcare scenes, product mockups, team culture photos. Our budget is $500."

A year ago, Marcus would have said no. Custom illustrations take 2-3 days per piece. Stock photos would look generic. There was no way to deliver 12 unique visuals in 19 hours for $500.

Instead, Marcus opened Midjourney. By 9 PM he had 30 candidate images — futuristic hospital lobbies, doctors using AR headsets, abstract data visualizations. He refined 12 of them, composited a few in Canva, and delivered the deck at 8 AM. The founder closed a $2M seed round.

Marcus didn't replace his design skills. He used AI as a starting point — a way to go from blank canvas to first draft in minutes instead of hours.

15M+images generated daily across major AI platforms (estimated mid-2025 — exact figures vary; Midjourney alone reported millions of users)

60secaverage time to generate an AI image

10$Midjourney starting price per month

The AI image generation landscape

Four tools dominate AI image generation, each with distinct strengths.

ToolBest forStyleAccessCost
MidjourneyArtistic, stylized imageryPainterly, cinematic, polishedDiscord bot or web app$10-$60/mo
DALL-E 3Quick concept imagesClean, illustrative, literalBuilt into ChatGPTIncluded with ChatGPT Plus ($20/mo)
Stable DiffusionFull control, customizationAnything (open-source)Local install or web UIsFree (self-hosted) or $10/mo (DreamStudio)
Adobe FireflyCommercial-safe imagesProfessional, stock-photo feelAdobe Creative CloudIncluded with CC ($23/mo) or free tier

✗ Without AI

  • ✗Stunning artistic quality
  • ✗Requires Discord or separate web app
  • ✗Better for mood, atmosphere, style
  • ✗Strong with abstract and creative prompts
  • ✗More steps to get started

✓ With AI

  • ✓Built into ChatGPT — no separate tool
  • ✓Better at following literal instructions
  • ✓Stronger with text in images
  • ✓Easy to iterate in conversation
  • ✓Good enough for most business uses

Writing prompts that actually work

The gap between a mediocre AI image and a stunning one is almost always the prompt. Here's the formula that works across all tools.

The anatomy of a good image prompt:

[Subject] + [Style/Medium] + [Lighting] + [Composition] + [Mood] + [Details]
ComponentExampleWhy it matters
Subject"A woman reviewing data on a holographic display"What's in the image
Style"Digital illustration, clean vector style"Determines the visual feel
Lighting"Soft blue ambient lighting"Sets mood dramatically
Composition"Wide shot, rule of thirds"Controls framing
Mood"Professional, futuristic, optimistic"Emotional tone
Details"Minimal UI elements, glass desk, dark background"Specificity matters

Weak prompt: "A business meeting"

Strong prompt: "Professional team of four people collaborating around a glass conference table, modern office with floor-to-ceiling windows, natural daylight, candid photography style, diverse team, laptops and notebooks on the table, shallow depth of field, warm and productive mood"

⚡

Write an image prompt

25 XP
Pick ONE of these scenarios and write a detailed image prompt using the formula above (subject + style + lighting + composition + mood + details): 1. A hero image for a fintech startup's landing page 2. A blog post illustration about remote work productivity 3. A social media graphic about AI in education Your prompt should be at least 3 lines long with specific visual details. No vague descriptions like "nice" or "cool."

There Are No Dumb Questions

"Can I use AI-generated images commercially?"

It depends on the tool. Midjourney's paid plans grant commercial usage rights. DALL-E 3 via ChatGPT Plus allows commercial use. Adobe Firefly is specifically trained on licensed content to be commercially safe. Stable Diffusion depends on the model and license. Always check the specific tool's current terms of service before using images in commercial projects.

"Will AI replace graphic designers?"

No — but it's changing the job. Designers who use AI generate concepts faster, iterate more quickly, and spend more time on high-level creative direction. AI handles the first draft; humans handle taste, brand consistency, and final polish. The designers struggling are those who refuse to learn the tools.

AI for video generation

Video is where AI tools are evolving fastest. As of mid-2025, you can generate short clips from text or images.

ToolWhat it doesBest forCost
Runway Gen-3Text-to-video, image-to-video, video editingShort cinematic clips, product demos$12-$76/mo
PikaText-to-video with style controlSocial media clips, stylized animationsFree tier / $8/mo
Kling AIHigh-quality video generationRealistic motion, longer clipsFree tier / varies
Sora (OpenAI)Photorealistic video from text(Availability varies — check openai.com for current access)TBD
⚠️Video AI is early
AI video tools are impressive but still limited. Expect 4-10 second clips, occasional artifacts (extra fingers, morphing objects), and inconsistent motion. They're useful for concept videos, social media teasers, and prototyping — not for replacing a production video shoot. Quality improves monthly.

Practical video workflows

Social media teasers — Generate a 5-second loop from a product image for Instagram Reels

Concept visualization — Show a client what a final video might look like before hiring a production team

Background footage — Create abstract or atmospheric B-roll for presentations

Ad prototyping — Test 10 different visual concepts before committing budget to a real shoot

Image editing with AI

Beyond generating images from scratch, AI tools now handle editing tasks that used to require Photoshop expertise.

TaskToolHow it works
Remove backgroundsRemove.bg, Canva AIOne click — AI detects subject and removes background
Extend imagesDALL-E outpainting, Photoshop Generative FillAI generates content beyond the original frame
Remove objectsPhotoshop Generative Fill, Cleanup.picturesSelect an object, AI fills the space naturally
UpscaleTopaz Gigapixel, Magnific AIAI adds detail to low-resolution images
Style transferMidjourney /describe + /imagineUpload a reference image, generate new ones in that style

⚡

Plan a real visual project

50 XP
Pick a real project you're working on (or will work on) and plan an AI-assisted visual workflow: 1. **The project:** What visuals do you need? (e.g., pitch deck, social campaign, website redesign) 2. **Tool selection:** Which AI image tool would you use for each visual, and why? 3. **Prompt drafts:** Write 2 specific prompts for images you'd need 4. **Editing plan:** Which AI editing tools would you use for refinement? 5. **Human touch:** What would still need a human designer's input?

There Are No Dumb Questions

"How do I get consistent style across multiple AI images?"

Use a style reference in every prompt. In Midjourney, use the --sref flag or upload a reference image. In DALL-E 3, describe the style explicitly in each prompt ("digital illustration, flat design, muted earth tones, thin line art"). In Stable Diffusion, use a consistent model checkpoint and style LoRA. Consistency is the hardest part of AI image work.

"Are AI images always obvious?"

Not anymore. The best AI images are nearly indistinguishable from professional photography or illustration. But telltale signs remain: text in images is often garbled, hands can look wrong, and backgrounds may have impossible geometry. Always review carefully.

When NOT to use AI images

✗ Without AI

  • ✗Internal presentations and mockups
  • ✗Social media content that refreshes weekly
  • ✗Blog post illustrations
  • ✗Concept exploration and brainstorming
  • ✗Rapid prototyping

✓ With AI

  • ✓Brand photography that must feel authentic
  • ✓Legal or compliance-regulated materials
  • ✓Images where specific real people must appear
  • ✓Final production assets for major campaigns
  • ✓Situations where AI-generated content could erode trust

🔑The real workflow
The professionals getting the most from AI image tools don't generate final assets with AI. They use AI for the first 80% — concept, composition, mood — and then refine with traditional tools for the last 20%. The speed gain comes from going from idea to first draft in minutes instead of hours.

Key takeaways

  • Four major AI image tools: Midjourney (artistic), DALL-E 3 (convenient), Stable Diffusion (customizable), Adobe Firefly (commercially safe)
  • Prompt quality determines output quality — use the formula: subject + style + lighting + composition + mood + details
  • AI video generation is real but early — useful for short clips, prototyping, and social media, not for production video
  • AI image editing (background removal, outpainting, upscaling) is mature and production-ready
  • Always check commercial usage rights for your specific tool and plan
  • Best workflow: AI for the first 80%, human refinement for the last 20%

?

Knowledge Check

1.Which AI image tool is specifically designed to be commercially safe, trained on licensed content?

2.What is the most important factor in getting high-quality AI-generated images?

3.What is the current state of AI video generation tools like Runway and Pika?

4.What is the recommended professional workflow for using AI image tools?

Previous

AI for Spreadsheets

Next

GitHub Copilot & Coding Assistants