Automatic Segmentation Tool for Long Webpage Screenshots
1 Background
When needing to share or analyze web content, long screenshots are a very practical form as they can fully display the page. However, processing these long screenshots while maintaining their information integrity and readability, and facilitating subsequent operations, has always been a challenge. For example, as of early 2024, mainstream AI image models on the market still cannot handle very large and complex images. If a long screenshot is forcibly input into the model, it will result in degraded performance (many details cannot be recognized). To solve this problem, I developed a tool based on OpenCV, aimed at simplifying the processing of long screenshots while maintaining their content integrity and readability.
This project is open source on my Github: https://github.com/Ryaang/Web-page-Screenshot-Segmentation
Unlike many existing tools or methods, Web-page-Screenshot-Segmentation uses OpenCV to automatically identify and follow the natural dividing lines of web content, automatically finding the most suitable segmentation points. This means that whether it is titles, paragraphs, or charts, they can be neatly retained in the segmented images without content breakage or omission.
Using Web-page-Screenshot-Segmentation is very simple. You only need to prepare a long screenshot, and the tool will automatically analyze the image content and intelligently decide the segmentation points. The result will be a series of complete and well-structured images, convenient for sharing and further processing.
2 Introduction
This project is used to segment long screenshots of web pages into several parts based on the height of the text. The main idea is to find the low-variation areas of the image and then find the segmentation lines in these areas.
The output is small but complete images of the webpage, which can be used to generate webpages or train models using Screen-to-code. More results can be found in the images directory.
3 Getting Started
3.1 Installation
|
|
4 Using in Command Line
Get the height of the segmentation lines of the image
|
|
The output should be a list: [6, 868, 1912, 2672, 3568, 4444, 5124, 6036, 7698]
. It is a list of the heights of the image segmentation lines.
If you want to display this segmentation line in the image, you can add the -s True
parameter:
|
|
4.1 Draw Segmentation Lines in the Image
|
|
4.2 Split Image
|
|
You will get the segmented images saved in the path returned by the command.
For more usage explanations, please refer to the help:
|
|
5 Using from Source Code
5.1 split_heights Function
The split_heights
function is used to segment the image into several parts based on various thresholds. It accepts the following parameters:
file_path
: The path of the image file.split
: A boolean indicating whether to split the image.height_threshold
: The height threshold of the low-variation area.variation_threshold
: The variation threshold of the low-variation area.color_threshold
: The color difference threshold.color_variation_threshold
: The color difference variation threshold.merge_threshold
: The minimum distance threshold between two lines.
If split
is False
, the function returns a list of the heights of the segmentation lines; if split
is True
, it returns the path of the segmented images.
5.1.1 Example Usage
|
|
In this example, the image at ‘path/to/image.jpg’ is segmented into several parts based on the provided thresholds. The segmented images are saved in the path returned by the function.
5.2 draw_line_from_file Function
The draw_line_from_file
function is used to draw lines on the image at specified heights. It accepts the following parameters:
image_file
: The path of the image file.heights
: A list of heights at which to draw lines.color
: The color of the lines. The default color is red(0, 0, 255)
.
The function reads the image from the provided file path, draws lines at the specified heights, and then saves the modified image to a new file. The new file is saved in the result
directory, with the same name as the original file but with ‘result’ added before the file extension.
If the function encounters an error reading the image file (e.g., if the file path contains ‘.’ or Chinese characters), it will throw an exception.
5.2.1 Example Usage
|
|
In this example, the image at ‘path/to/image.jpg’ is modified to draw green lines at heights 100 and 200. The modified image is saved in the path returned by the function.
data:image/s3,"s3://crabby-images/2c1ca/2c1ca16c01082fb2a11697efb7173e86ae4792ff" alt="Tim Alipay"
data:image/s3,"s3://crabby-images/9226d/9226da7a8f9364df6f09972e89d662dc7d8236f8" alt="Tim PayPal"
data:image/s3,"s3://crabby-images/d311e/d311e96bee586e5a732e6f5b1afbb16403be5637" alt="Tim WeChat Pay"