Hardcoded Text vs Soft Subtitles: Why It Matters
Before removing text from a video, it's worth understanding what kind of text you're dealing with because the technique varies completely. There are two fundamentally different categories:
- Soft subtitles (closed captions): Stored as a separate text track inside the video file (SRT, VTT, or embedded as subtitle streams in MKV/MP4 containers). These can be toggled on/off in any video player and removed cleanly using free tools like FFmpeg or HandBrake no AI needed. If your video has soft subtitles, you can disable them in seconds without touching the actual video pixels.
- Hardcoded text (burned in): Rendered directly into the video pixels at export time. Cannot be toggled off because it's part of the picture itself. This is what Wipe AI removes. Common examples: TikTok captions, YouTube auto generated subtitles that were rendered in, news ticker text, on screen titles, social media graphics, song lyric overlays, meme captions.
If you're not sure which type your video has, try playing it in VLC and toggling subtitles off (Subtitle → Subtitle Track → Disable). If the text disappears, it was soft. If it stays, it's hardcoded and that's where Wipe AI comes in.
Auto Detect Mode: How It Finds Text Without You Marking It
For videos with multiple text overlays (animated captions, scrolling lyrics, news tickers), manually drawing a box for each occurrence is tedious. Wipe AI's Auto Detect mode runs an OCR style scanner across every frame to identify where text exists, then automatically inpaints all detected regions in a single pass.
The detection works on:
- Latin alphabets (English, Spanish, French, German, Portuguese, Turkish, Italian)
- Cyrillic (Russian, Ukrainian, Bulgarian)
- CJK scripts (Chinese, Japanese, Korean)
- Arabic and Hebrew (right to left scripts)
- Numbers, dates, timestamps, and counters
Auto Detect is the right choice when text appears in many places throughout the video. For a single static title or watermark, manual selection is faster and more accurate.
Common Sources of Burned In Text
The most common situations where users need text removal:
- TikTok auto captions: TikTok's built in caption generator burns text into the video at upload. Once posted, the only way to remove the captions is AI inpainting.
- News and TV broadcasts: Lower thirds, scrolling tickers, station bugs, breaking news banners all hardcoded.
- Lyric videos: Animated lyrics rendered into the video pixels by the original creator.
- Educational content: Tutorial videos with on screen instructions, formulas, or step labels.
- Subtitled foreign films: Translated dialogue burned into the video, common on pirated or repackaged content where the original soft subtitle track was lost.
- Social media memes: Top/bottom text rendered on viral video clips.
- Date/time stamps: Camera burned timestamps from old camcorders, security camera footage, or dashcams.
When AI Text Removal Is Most Effective
AI inpainting works best in these conditions:
- Background changes over time: If the camera moves or the subject moves, the AI sees the area "without text" in nearby frames and uses that as reference. Static scenes with text on a still background are slightly harder.
- Text contrasts strongly with background: White text on a dark background, or vice versa, gives the AI a clean edge to work with. Text that blends into the background (similar colors) is more challenging.
- Higher resolution input: 1080p videos give the AI more pixel context than 480p. Always start from the highest quality version available.
- Text stays in roughly the same area: Captions that stay near the bottom of the screen are easier to clean than text that bounces around the entire frame.
What to Do When Text Sits Over a Face
Burned in text overlays often land on faces particularly TikTok captions positioned at face level. AI inpainting can reconstruct most facial features convincingly, but eyes, mouth, and fine skin texture sometimes show subtle artifacts. Two strategies:
- Crop instead of inpaint if the text is near the bottom or top edge.
- Re record if it's your own content and you can repeat the take with the caption positioned away from the face.
- Accept minor artifacts if neither option is feasible Wipe AI's output is still significantly better than blur or crop.
Related Tools
Different text removal needs have specialized workflows. See our dedicated guides for: subtitle removal, brand bug removal, and platform specific tools for TikTok Watermark Remover, CapCut Watermark Remover, and Kinemaster watermarks.


