Introduction

The SAM3 (Segment Anything Model) suite by Meta provides industry-leading computer vision tools for high-precision masking and asset isolation. These models allow creators to precisely segment specific elements from both static images and moving footage, significantly accelerating workflows for graphic design, compositing, and VFX.
SAM3 Image
SAM3 Image is a text-prompted image segmentation model designed to isolate specific elements within a static photograph.
Precision Masking: By uploading an image and describing target objects—such as a "dress" or "people"—the system generates precise masks for each identified element.
Multi-Instance Detection: If multiple instances of an object are present, the model can produce individual masks for every person or object detected.
Asset Isolation: This tool streamlines the process of isolating assets, making it easier to prepare elements for complex graphic design and digital compositing.
SAM3 Video
SAM3 Video is a text-prompted video segmentation model built to isolate and mask specific elements within a moving scene.
Temporal Consistency: After identifying a target element through text, the tool generates masks that track that element across the entire video sequence with high consistency.
Automated Rotoscoping: It effectively eliminates the need for time-consuming manual rotoscoping.
VFX Efficiency: Editors and VFX artists can use these masks to apply effects or remove backgrounds through complex motion significantly faster than traditional methods.
Technical Controls & Parameters
To achieve the most accurate masks, users have access to specific prompting tools and settings:
Image Prompt Boxes: In SAM3 Image, you can add specific bounding boxes to visually guide the model toward the exact objects you wish to segment.
Video Prompt Text: Describe your scene or subject directly to guide the video segmentation.
Prompt Tools: Utilize integrated tools to generate, complete, or translate prompts, or even upload a reference image to convert it into a prompt for the model.
Was this helpful?