Tenyks

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 2)

Algorithms, Blog Posts, Object Identification, Object Tracking, Software, Tenyks, Tools / April 4, 2025

This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 2, we explore how Foundation Models can be leveraged to track deforestation patterns. ‍Building upon the insights from our Sentinel-2 pipeline and Central Balkan case study, we dive into the revolution that foundation models have […]

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 2) Read More +

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 1)

Algorithms, Blog Posts, Object Identification, Object Tracking, Software, Tenyks, Tools / March 28, 2025

This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Satellite imagery has revolutionized how we monitor Earth’s forests, offering unprecedented insights into deforestation patterns. ‍In this two-part series, we explore both traditional and cutting-edge approaches to forest monitoring, using Bulgaria’s Central Balkan National Park as our

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 1) Read More +

Video Understanding: Qwen2-VL, An Expert Vision-language Model

Algorithms, Articles, Multimodal, Software, Tenyks, Tools / March 21, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Qwen2-VL, an advanced vision language model built on Qwen2 [1], sets new benchmarks in image comprehension across varied resolutions and ratios, while also tackling extended video content. ‍Though Qwen2-V excels at many fronts, this article explores the model’s

Video Understanding: Qwen2-VL, An Expert Vision-language Model Read More +

Scalable Video Search: Cascading Foundation Models

Algorithms, Articles, Multimodal, Object Identification, Object Tracking, Software, Tenyks, Tools / March 14, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Video has become the lingua franca of the digital age, but its ubiquity presents a unique challenge: how do we efficiently extract meaningful information from this ocean of visual data? ‍In Part 1 of this series, we navigate

Scalable Video Search: Cascading Foundation Models Read More +

Zero-Shot AI: The End of Fine-tuning as We Know It?

Algorithms, Articles, Face Detection, Software, Tenyks, Tools / March 7, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Models like SAM 2, LLaVA or ChatGPT can do tasks without special training. This has people wondering if the old way (i.e., fine-tuning) of training AI is becoming outdated. In this article, we compare two models: YOLOv8 (fine-tuning)

Zero-Shot AI: The End of Fine-tuning as We Know It? Read More +

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 2)

Algorithms, Articles, Multimodal, Object Identification, Object Tracking, Software, Tenyks, Tools / February 28, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 2 of our Segment Anything Model 2 (SAM 2) Series, we show how foundation models (e.g., GPT-4o, Claude Sonnet 3.5 and YOLO-World) can be used to generate visual inputs (e.g., bounding boxes) for SAM 2. Learn

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 2) Read More +

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 1)

Algorithms, Blog Posts, Multimodal, Object Identification, Object Tracking, Software, Tenyks, Tools / February 21, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 1 of this article we introduce Segment Anything Model 2 (SAM 2). Then, we walk you through how you can set it up and run inference on your own video clips. Learn more about visual prompting

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 1) Read More +

RAG for Vision: Building Multimodal Computer Vision Systems

Algorithms, Blog Posts, Multimodal, Software, Tenyks / February 7, 2025

This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. This article explores the exciting world of Visual RAG, exploring its significance and how it’s revolutionizing traditional computer vision pipelines. From understanding the basics of RAG to its specific applications in visual tasks and surveillance, we’ll examine

RAG for Vision: Building Multimodal Computer Vision Systems Read More +

Multimodal Large Language Models: Transforming Computer Vision

Algorithms, Blog Posts, Multimodal, Object Identification, Software, Tenyks, Tools / January 31, 2025

This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. This article introduces multimodal large language models (MLLMs) [1], their applications using challenging prompts, and the top models reshaping computer vision as we speak. What is a multimodal large language model (MLLM)? In layman terms, a multimodal

Multimodal Large Language Models: Transforming Computer Vision Read More +

DALL-E vs Gemini vs Stability: GenAI Evaluations

Algorithms, Articles, Software, Tenyks, Tools / January 27, 2025

This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. We performed a side-by-side comparison of three models from leading providers in Generative AI for Vision. This is what we found: Despite the subjectivity involved in Human Evaluation, this is the best approach to evaluate state-of-the-art GenAI Vision

DALL-E vs Gemini vs Stability: GenAI Evaluations Read More +

If you're building AI or vision-enabled products, you've come to the right place.

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 2)

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 1)

Video Understanding: Qwen2-VL, An Expert Vision-language Model

Scalable Video Search: Cascading Foundation Models

Zero-Shot AI: The End of Fine-tuning as We Know It?

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 2)

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 1)

RAG for Vision: Building Multimodal Computer Vision Systems

Multimodal Large Language Models: Transforming Computer Vision

DALL-E vs Gemini vs Stability: GenAI Evaluations

Pages

Topics

Contact

Address

Phone