Analyze images and videos with Gemini to get fast, reliable visual insights. Handle content from U…
This is remote server, use local version for local images and videos.
Install the server in Claude Desktop:
npx -y @smithery/cli install @Artin0123/gemini-image-mcp-server --client claude# Clone the repository
git clone https://github.com/Artin0123/gemini-vision-mcp.git
cd gemini-vision-mcp
# Install dependencies
npm install
# Compile TypeScript to dist/
npm run buildCreate a Gemini API key in Google AI Studio and provide GEMINI_API_KEY to the server.
{
"mcpServers": {
"gemini-media": {
"command": "node",
"args": ["/absolute/path/to/gemini-vision-mcp/dist/index.js"],
"env": {
"GEMINI_API_KEY": "your_api_key_here",
"GEMINI_MODEL": "models/gemini-flash-lite-latest"
}
}
}
}If no key is supplied, the server can still start (handy for automated scans), but any tool invocation will return a configuration error until a valid API key is configured.
The server defaults to models/gemini-flash-lite-latest. Override it by either:
Setting the
GEMINI_MODELenvironment variable, or ProvidingmodelNamein the Smithery/SDK configuration schema.
analyze_image: Analyze one or more image URLs. Maximum file size: 16 MB per image.analyze_youtube_video: Analyze a YouTube video from URL. No size limit.Image URLs are downloaded and processed with a 16 MB size limit to ensure fast response times. Files exceeding this limit will result in an error message indicating the actual file size.
YouTube videos are streamed directly by Gemini API without downloading, so there is no size restriction.
Please analyze this product photo: https://teimg-bgr.pages.dev/file/mvYT6KeF.webp
Extract the main talking points from this clip: https://www.youtube.com/watch?v=dQw4w9WgXcQ
npm install
npm test
npm run buildThe test suite exercises URL forwarding, MIME handling, and configuration fallbacks.
MIT