dgs.models.dataset.keypoint_rcnn.KeypointRCNNVideoBackbone¶

class dgs.models.dataset.keypoint_rcnn.KeypointRCNNVideoBackbone(*args: Any, **kwargs: Any)[source]¶

A Dataset that gets the path to a single Video file and predicts the bounding boxes and key points of the Video.

Predicts 17 key-points (like COCO).

References

https://pytorch.org/vision/0.17/models/generated/torchvision.models.detection.keypointrcnn_resnet50_fpn.html

Metaclass for the torchvision Key Point RCNN backbone model.

This class sets up the RCNN model and validates and sets the basic modules parameters.

Params:

data_path (FilePath) – A single path or a list of paths. The path is either a directory, a single image file, or a list of image filepaths.

Optional Params:

score_threshold (float, optional) – Detections with a score lower than the threshold will be ignored. Default DEF_VAL.dataset.kprcnn.score_threshold.
iou_threshold (float, optional) – Bounding-boxes with IoU above this threshold will be ignored. Default DEF_VAL.dataset.kprcnn.iou_threshold.
force_reshape (bool, optional) – Whether to force reshape all the input images. Change the size and mode via image_mode and image_size parameters, iff force_reshape is True. Default DEF_VAL.images.force_reshape.
image_size (:obj:`ImgSize`, optional) – The size, the loaded image should have, iff force_reshape is True. Default DEF_VAL.images.image_size.
image_mode (str, optional) – The mode to use when loading the image, iff force_reshape is True. Default DEF_VAL.images.image_mode.
crop_size (:obj:`ImgSize`, optional) – The size, the image crop should have. Default DEF_VAL.images.crop_size.
crop_mode (str, optional) – The mode to use when cropping the image. Default DEF_VAL.images.crop_mode.
bbox_min_size (float, optional) – The minimum side length a bounding box should have in pixels. Smaller detections will be discarded. Works in addition to the threshold parameter. If you do not want to discard smaller bounding boxes, make sure to set bbox_min_size to 1.0. The size of the bounding boxes is in relation to the original image. Default DEF_VAL.images.bbox_min_size.
weights (KeypointRCNN_ResNet50_FPN_Weights, optional) – The weights to load for the model. Default KeypointRCNN_ResNet50_FPN_Weights.COCO_V1.

Methods

__init__(config: dict[str, any], path: list[str]) → None[source]¶

arbitrary_to_ds(a: torchvision.tv_tensors.Image | torch.Tensor, idx: int) → list[State][source]¶: Given a frame of a video, return the resulting state after running the RCNN model.

configure_torch_module(module: torch.nn.Module, train: bool | None = None) → torch.nn.Module¶

Set compute mode and send model to the device or multiple parallel devices if applicable.

Parameters:

module – The torch module instance to configure.
train – Whether to train or eval this module, defaults to the value set in the base config.

Returns:

The module on the specified device or in parallel.

get_image_crops(ds: State) → None¶

Add the image crops and local key-points to a given state. Works for single or batched State objects. This function modifies the given State in place.

Will load precomputed image crops by setting self.params["crops_folder"].

get_path_in_dataset(path: str) → str¶

Given an arbitrary file- or directory-path, return its absolute path.

check whether the path is a valid absolute path
check whether the path is a valid project path
check whether the path is an existing path within self.params[“dataset_path”]

Returns:: The absolute found path to the file or directory.
Raises:: FileNotFoundError – If the path is not found.

images_to_states(images: list[torchvision.tv_tensors.Image | torch.Tensor]) → list[State]¶

Given a list of images, use the key-point-RCNN model to predict key points and bounding boxes, then create a State containing the available information.

Notes

Does not add the original image to the new State, to reduce memory / GPU usage. With the filepath given in the state, the image can be reloaded if required.

terminate() → None¶

Terminate this module and all of its submodules.

If nothing has to be done, just pass. Is used for terminating parallel execution and threads in specific models.

static transform_crop_resize() → torchvision.transforms.v2.Compose¶

Given one single image, with its corresponding bounding boxes and key-points, obtain a cropped image for every bounding box with localized key-points.

This transform expects a custom structured input as a dict.

>>> structured_input: dict[str, any] = {
    "image": tv_tensors.Image,
    "box": tv_tensors.BoundingBoxes,
    "keypoints": torch.Tensor,
    "output_size": ImgShape,
    "mode": str,
}

Returns:

A composed torchvision function that accepts a dict as input.

After calling this transform function, some values will have different shapes:

image: Now contains the image crops as tensor of shape [N x C x H x W].
bboxes: Zero, one, or multiple bounding boxes for this image as tensor of shape [N x 4]. And the bounding boxes got transformed into the XYWH format.
coordinates: Now contains the joint coordinates of every detection in local coordinates in shape [N x J x 2|3].

static transform_resize_image() → torchvision.transforms.v2.Compose¶

Given an image, bboxes, and key-points, resize them with custom modes.

This transform expects a custom structured input as a dict.

>>> structured_input: dict[str, any] = {
    "image": tv_tensors.Image,
    "box": tv_tensors.BoundingBoxes,
    "keypoints": torch.Tensor,
    "output_size": ImgShape,
    "mode": str,
}

Returns:: A composed torchvision function that accepts a dict as input.

validate_params(validations: dict[str, list[str | type | tuple[str, any] | Callable[[any, any], bool]]], attrib_name: str = 'params') → None¶

Given per key validations, validate this module’s parameters.

Throws exceptions on invalid or nonexistent params.

Parameters:

attrib_name – name of the attribute to validate, should be “params” and only for base class “config”
validations –
Dictionary with the name of the parameter as key and a list of validations as value. Every validation in this list has to be true for the validation to be successful.
The value for the validation can have multiple types:
- A lambda function or other type of callable
- A string as reference to a predefined validation function with one argument
- None for existence
- A tuple with a string as reference to a predefined validation function with one additional argument
- It is possible to write nested validations, but then every nested validation has to be a tuple, or a tuple of tuples. For convenience, there are implementations for “any”, “all”, “not”, “eq”, “neq”, and “xor”. Those can have data which is a tuple containing other tuples or validations, or a single validation.
- Lists and other iterables can be validated using “forall” running the given validations for every item in the input. A single validation or a tuple of (nested) validations is accepted as data.

Example

This example is an excerpt of the validation for the BaseModule-configuration.

>>> validations = {
    "device": [
            str,
            ("any",
                [
                    ("in", ["cuda", "cpu"]),
                    ("instance", torch.device)
                ]
            )
        ],
        "print_prio": [("in", PRINT_PRIORITY)],
        "callable": (lambda value: value == 1),
    }

And within the class __init__() call:

>>> self.validate_params()

Raises:

InvalidParameterException – If one of the parameters is invalid.
ValidationException – If the validation list is invalid or contains an unknown validation.

Attributes

`device`	Get the device of this module.
`is_training`	Get whether this module is set to training-mode.
`module_name`	Get the name of the module.
`module_type`
`name`	Get the name of the module.
`name_safe`	Get the escaped name of the module usable in filepaths by replacing spaces and underscores.
`precision`	Get the (floating point) precision used in multiple parts of this module.
`data`	Arbitrary data, which will be converted using `self.arbitrary_to_ds()`
`model`
`dataset_path`	The base path to the dataset.

dgs.models.dataset.keypoint_rcnn.KeypointRCNNVideoBackbone¶

Table of Contents

Previous topic

Next topic

This Page