Labeling Blob Analysis
Some of our shape recognition technology is based on the feature analysis (blob analysis) of labeled regions (primitives) in images.
For monochrome images, we can perform labeling of all shapes and calculation of basic attributes (area, size, etc.) of regions (primitives) within 1 second, even for complex topographic maps larger than A0 or A1 double-page newspapers.
In our case, we refer to labeled regions as "primitives". Kanji characters and symbols are divided into one or more primitives and analyzed. English uppercase letters and numbers are treated as one character = one primitive.

Primitives

For monochrome images, there are two types of labeling: 4-connected labeling, which considers regions (primitives) as one when there is a pixel connection in the four directions (up, down, left, right), and 8-connected labeling, which also allows diagonal connections.

4-connected and 8-connected

For grayscale and color images:
  1. Labeling based on color reduction processing that statistically minimizes the error with the original image
  2. Labeling based on simple distance in brightness or color space
  3. Labeling based on colorization using optimization
are used to divide regions. In particular, the third method has a high computational cost, but it has advantages such as:
  • Ability to achieve optimal region division based on the definition of distance between pixels used in the calculations
  • Ease of parallelization for accelerated computation
  • Ability to be directly applied to videos
Due to these characteristics, its usage frequency is gradually increasing.
The basic attributes that can be calculated using our library are as follows:
  1. Bounding rectangle
  2. Exists between (xmin,ymin) and (xmax,ymax).
    Horizontal Feret diameter = xmax-xmin Vertical Feret diameter = ymax-ymin
  3. Area
  4. Perimeter
  5. Circularity
  6. circleness
  7. Line width
  8. Absolute maximum length, BR'DTH, ORIENT
Labeling is used in the following ways:
  1. Filling regions with background color as noise for regions below a certain area
  2. For example, in the case of monochrome, both black and white regions can be labeled, so isolated points or holes can be removed as noise.
  3. Estimating line width from line drawings
  4. Estimating the boundaries of strings or paragraphs based on distances and positional relationships between primitives