Prediction Pipeline In Cloud - Models

Apart from applying the prediction models to analyze images, the script that runs the prediction pipeline on Google Cloud ingests data into BigQuery to store all the image analysis details.

After setting up a server in a Google Cloud account, you can start recording videos of objects passing on your conveyor belt to gather data for analysis. The next step is transferring those video or image files to a Cloud Storage bucket, where the prediction pipeline processes the images.

The results of each video or image prediction are stored and appended to two BigQuery tables, ensuring efficient data management. After systematically processing each file in the Cloud Storage bucket, the pipeline exports and stores the results in another bucket for further use and analysis.

This page explains how to create and manage Cloud Storage buckets on Google Cloud for the videos you record, run the prediction pipeline to apply the models, and process images for further analysis. You can then connect a visualization dashboard to the BigQuery tables to display results as charts and reports.

Store videos in Cloud Storage buckets

To effectively manage and process the videos or images recorded by the machine vision camera capturing objects on the conveyor belt, you need the following storage devices:

Local disk storage: Temporarily cache or store files locally from the camera.
Cloud Storage input bucket: Store your recorded videos or images. You can automate uploads using a storage transfer agent.
Cloud Storage output bucket: Store the prediction results.

By using these storage options, you ensure an organized, automated, and efficient workflow for managing and processing your recorded videos or images and prediction results.

Create the Cloud Storage input and output buckets

Create a Cloud Storage bucket in your Google Cloud account that serves as the input repository for the recorded videos. This bucket stores all the videos or images captured by the machine vision camera and centralizes the storage of your recorded files, making them easily accessible for further processing.

Before uploading to the input bucket, store the files temporarily on your local disk in a predetermined location. This temporary storage ensures you have a local backup and can manage uploads efficiently.

To streamline the upload process to the input bucket, you can schedule a storage job to run at your expected frequency. This automated job takes care of transferring videos or images from the local disk to the Cloud Storage input bucket, relieving you of the manual task. Additionally, implementing a storage transfer agent further facilitates the transfer process, ensuring reliable and efficient uploads. For more information, see Manage transfer agents. Otherwise, manually upload your files stored or cached in your local storage to the input bucket before running the prediction pipeline. For more information about manual uploads, see Upload objects from a file system.

In addition to the input bucket, create another Cloud Storage output bucket to store the results generated by your prediction pipeline. This output bucket serves as a well-organized repository for the processed data, such as predictions and analysis results, ensuring that the output from your models is easily accessible and you can manage your pipeline's output.

Grant required permissions

The service account from your NVIDIA T4 GPU virtual machine (VM) instance must have access to Cloud Storage buckets and BigQuery tables to let the VM instance run the prediction pipeline and upload the results to BigQuery.

The VM you created when deploying CircularNet has a service account that must act as a principal with the following two roles:

Grant the Storage Admin role to access your Cloud Storage input and output buckets. Add the VM service account as a principal to a bucket-level policy for both buckets.
Grant the BigQuery Admin role to access your BigQuery tables. Manage roles of the VM service account using the Google Cloud console.

Run the prediction pipeline

Follow these steps to run the prediction pipeline and process your input files on Google Cloud:

Open the Google Cloud console.
Create the Cloud Storage input and output buckets.
Upload the images or videos your machine vision camera captured to the Cloud Storage input bucket.
Grant the required permissions to the VM service account.
From the Navigation menu on the Google Cloud console, select Compute Engine > VM instances.
On the VM instances page, find the VM instance you created with the NVIDIA T4 GPU when deploying CircularNet.
If you stopped your VM instance, restart it by clicking More actions > Start / Resume in the row of the instance that you want to restart.

Note: Stop the instance after you finish using it to avoid unnecessary expenses.
Click SSH in the row of the instance that you want to connect to. The SSH-in-Browser tool opens. For more information, see Connect to VMs.
On the SSH-in-browser window, start the server.
Display the names of the models you loaded to the Triton inference server:
```
cat triton_inference_server.sh
```
The first lines of the output show the names of the loaded models in square brackets. You can call any of these models when running the prediction pipeline.

Important: Run the previous command in the server folder, which contains the triton_inference_server.sh script.
Exit the server folder and open the client folder in the prediction_pipeline directory:
```
cd ..
cd client/
```
This folder contains the pipeline_images.py and pipeline_videos.py Python files that store the complete prediction pipelines for input images or videos, respectively. The run_gcp_images.sh and run_gcp_videos.sh scripts run these Python files automatically.
If you have to modify the scripts to provide your specific paths and values for the prediction pipeline, edit the corresponding parameter values on the script. The following example modifies the image pipeline script:
```
vim run_images.sh
```
The Vim editor displays the following parameters:
```
--input_directory=<path-to-input-bucket>
--output_directory=<path-to-output-bucket>
--height=<height>
--width=<width>
--model=<circularnet-model>
--score=<score>
--search_range_x=<search-range>
--search_range_y=<search-range>
--memory=<memory>
--project_id=<project-id>
--bq_dataset_id=<bigquery-dataset-id>
--bq_table_id=<bigquery-table-id>
--overwrite=<overwrtie_table>
--tracking_visualization=<visualize-tracking-results>
--cropped_objects=<crop-objects-per-category>
```
Replace the following:
- <path-to-input-bucket>: The path to the Cloud Storage input bucket you created, for example gs://my-input-bucket/.
- <path-to-output-bucket>: The path to the Cloud Storage output bucket you created, for example gs://my-output-bucket/.
- <frames-per-second>: The rate at which you want to capture images from videos to split videos into frames, for example, 15.
- <height>: The height in pixels of the image or video frames that the model expects for prediction, for example, 512.
- <width>: The width in pixels of the image or video frames that the model expects for prediction, for example, 1024.
- <circularnet-model>: The name of the CircularNet model in the Triton inference server that you want to call, for example, Jan2025_ver2_merged_1024_1024.
- <score>: The threshold for model prediction, for example, 0.70.
- <search-range>: The pixels up to which you want to track an object for object tracking in consecutive frames, for example, 100.
- <memory>: The frames up to which you want to track an object, for example, 20.
- <project-id>: The ID of your Google Cloud project, for example, my-project.
- <bigquery-dataset-id>: The ID that you want to assign to a BigQuery
  dataset to store prediction results, for example, circularnet_dataset.
- <bigquery-table-id>: The ID that you want to assign to a BigQuery
  table to store prediction results, for example, circularnet_table.
  If the table already exists in your Google Cloud project, the pipeline
  appends results to that table.
- <overwrtie_table> : If set to True, overwrites the pre-existing
  BigQuery table.
- <visualize-tracking-results>: If set to True, visualizes the
  tracking results from the tracking algorithm.
- <crop-objects-per-category>: If set to True, crops the objects per
  category according to the prediction and tracking results.
Note: If your input files are not videos but images, replace run_gcp_videos.sh on the command with run_gcp_images.sh and remove the --fps=<frames-per-second> parameter.

Save changes and exit the Vim editor. To do this, press the Esc key, type :wq, and then press Enter.
Enter the screen session for the client:
```
screen -R inference
```
Run the prediction pipeline:
```
bash run_images.sh
```
Note: If you have a large amount of input files, you can run the pipeline in a screen session in the background without worrying about the terminal closing down. First, you launch the screen session with the screen -R client command. A new session shell launches. Then, run the bash run_images.sh script in the new shell.

The script also creates a logs folder inside the client folder that saves the logs with the troubleshooting results and records from the models.

You have finished running the prediction pipeline and applying the prediction models to your files for further analysis. You can find the image results with the applied masks in your output bucket. You can also open the generated BigQuery table to see the model analytics results and preview table data. To create new results, repeat the steps in this section every time you modify the files in your input bucket.

Important: If you rerun the prediction pipeline on the same video or image file, you must delete the results created the first time you ran the script from the output bucket to avoid conflicting issues. Manage the lifecycle of objects in your Cloud Storage buckets to help manage costs.