Advice on Identifying Extremely Similar Objects in Real Time

-4

Closed. This question needs debugging details. It is not currently accepting answers.

Edit the question to include desired behavior, a specific problem or error, and the shortest code necessary to reproduce the problem. This will help others answer the question.

Closed 10 days ago.

Improve this question

I’m working on a real-time system that identifies objects, but I’m facing a challenge: new objects can look extremely similar to known ones (sometimes differences are as small as 0.1 mm), and I need the system to detect when an object is truly new. Objects may appear in different positions and rotations, and I need to handle multiple objects in the same image.

Currently, I’m using YOLOv8, which works well for detecting and identifying pieces. However, since YOLO is primarily for detection and localization, I’m considering using a model like ResNet or VGG16 to extract visual features after YOLO detects each piece.

I’d love advice on whether this approach is considered good practice, or if there are better architectures or strategies for handling very similar objects while detecting unknown ones reliably.

edited Nov 15 at 1:03

desertnaut

60.8k32 gold badges155 silver badges183 bronze badges

asked Nov 7 at 22:52

nour achahlaou

12 bronze badges

2

Please provide enough code so others can better understand or reproduce the problem.

Toni
– Toni

2025-11-08 09:28:14 +00:00
Commented Nov 8 at 9:28
Hi Toni, I understand your point — unfortunately, I can’t share the full code since it’s quite complex and part of a larger industrial system. However, I can explain the core logic behind it. I’m working with very small aircraft parts that can look extremely similar (sometimes the differences are below 0.1 mm and barely visible). I use YOLOv8 for detection and localization, and each image comes from a 5400×3600 camera. Because of GPU constraints, I had to resize and crop the images carefully to keep as much detail as possible.

nour achahlaou
– nour achahlaou

2025-11-12 21:51:36 +00:00
Commented Nov 12 at 21:51
The dataset is also dynamically generated — images are captured and annotated automatically through code, so the full implementation is tightly coupled with the capture and annotation pipeline. That’s why I can’t easily isolate a “minimal reproducible example.” My main question is more architectural: whether using YOLO for detection and then a feature extractor (like ResNet or VGG16) for visual similarity and novelty detection is a good strategy for distinguishing between nearly identical parts and detecting unknown ones.

nour achahlaou
– nour achahlaou

2025-11-12 21:51:42 +00:00
Commented Nov 12 at 21:51
please take it to Cross Validated

Christoph Rackwitz
– Christoph Rackwitz

2025-11-12 22:55:23 +00:00
Commented Nov 12 at 22:55

Add a comment |

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Advice on Identifying Extremely Similar Objects in Real Time [closed]

0

Hot Network Questions