Add Cyclic Group Representation (CGR) class #182

Zeldax64 · 2025-05-28T16:11:19Z

Introduce a new VSA class. Solves #181.

Description

CGR is very similar to MCR, but differs in bundling. This pull request introduces:

A novel BaseMCRTensor parent class, which is used by MCRTensor and CGRTensor.
CGRTensor, which uses a different bundling than MCRTensor.
Tests for CGR. It requires just a custom bundling test.

Additionally, this commit also solves a bug in the MCR: __torch_function__() implementation. This function parses args to search for MCRTensors and ensures they all have the same block_sizes.

torchhd/torchhd/tensors/mcr.py

Lines 420 to 424 in 6b7478d

    
           block_sizes = set(a.block_size for a in args if hasattr(a, "block_size")) 
        
           if len(block_sizes) != 1: 
        
               raise RuntimeError( 
        
                   f"Call to {func} must contain exactly one block size, got {list(block_sizes)}" 
        
               )

However, it treats args as a shallow 1D container. In this case, the following snippet breaks the code:

import torch
from torchhd import embeddings

id = embeddings.Random(4, 10, vsa='MCR', block_size=4).weight
t = torch.stack((id[0], id[1]))

In this minimal example, torch.stack() calls __torch_function__ with a 2D tuple that can't be parsed.

Checklist

I added/updated documentation for the changes.
I have thoroughly tested the changes.

The only problem at the moment is in the documentation. Due to the addition of BaseMCRTensor, the documentation of MCR and CGR is split, requiring users to read two mnual pages. Also, I wasn't able to add new CGRTensor and BaseMCRTensor classes into the proper table of contents.

Add a new base class to support future implementation of variations of MCR proposals.

There is a bug when args is a collection of collections instead of a plain tuple. In this case, the old args parsing was unable to search for block_size in nested structures containing BaseMCRTensor. For instance, let `a` and `b` be BaseMCRTensor variables. Calling `torch.stack((a, b))` results in error, as the `args` received in `__torch_function__()` is a nested tuple.

Add a new VSA class named Cyclic Group Representation (CGR). This class is similar to MCR, but differs in bundling.

Allow its usage in level and circular embeddings as done with MCRTensor.

Ensure both inputs are in the same shape.

The CGR should behave almost the same as MCR, but diverges in bundling. Implement a custom bundling test for it.

mikeheddes

Thank you for opening this PR, this would be a great addition to the library. Your implementation looks good, I just added some minor comments. Good catch of the __torch_function__ bug btw.

torchhd/tensors/basemcr.py

torchhd/tensors/cgr.py

mikeheddes · 2025-05-28T17:10:57Z

It would also be good to add a link to the documentation page of the Cyclic Group Representation to the README. Note that the link will not work now but will work as soon as this PR gets merged (assuming you add the documentation page).

mikeheddes · 2025-05-28T17:29:20Z

I can think of two approaches to making the documentation for each VSATensor in one page. One is to not have the BaseMCRTensor class and simply duplicate the implementation for both the MCRTensor and the CGRTensor. The second is to implement all the methods that need documentation on both child classes simply by calling super like so:

    def multibind(self) -> "MCRTensor":
        """Bind multiple hypervectors"""
        return super().multibind()

This will add the method and docstring to the documentation page. Then all you have to do to include the CGRTensor in the documentation is to add it to docs/torchhd.rst. I would probably go for the second option.

Shorter syntax and more readable.

No need to check block size in CGR/MCR functions as the __torch_function__() in BaseMCRTensor already checks it.

- Raise ValueError instead of RuntimeError

Include CGRTensor in the README and in the built docs.

Zeldax64 · 2025-05-29T17:15:29Z

I've just pushed some commits. Regarding your comments,

Parameter checking

The use of assertions and RuntimeError stems from the current MCRTensor implementation. I just copied and pasted the code into BaseMCRTensor.

Do self and other require the same block_size for the similarity to make sense? If so, we should add the same error handling as in the bind method. However, it might be that the error handling in __torch_function__ already takes care of this, in which case there is no need for error handling in here or in bind

CGR and MCR operations (bind, bundle, and similarity) must always be done with hypervectors adopting the same block_size. After reading your comments, I took a deeper look into __torch_function__ calls, and I believe it is safe to remove any type checking from bind() and bundle() functions, as __torch_function__ already excessively checks the parameters.

In fact, there might be a slight performance issue since __torch_function__ is called on any PyTorch operation, including functions as __get__ or shape, which wouldn't require block_size checking.

Documentation

Then all you have to do to include the CGRTensor in the documentation is to add it to docs/torchhd.rst. I would probably go for the second option.

After some tests considering this option, I feel like repeating function signatures also isn't the way to go, as the code isn't DRY. Maybe let users check two doc pages and link CGR and MCR with BaseMCRTensor?

mikeheddes · 2025-06-02T17:06:08Z

Thank you for the edits.

After some tests considering this option, I feel like repeating function signatures also isn't the way to go, as the code isn't DRY. Maybe let users check two doc pages and link CGR and MCR with BaseMCRTensor?

When weighing between user experience or code style, I would suggest we lean towards a better user experience in favor of the coding best practices. Said plainly, the users don't care if our code is DRY but they get annoyed if they have to read the documentation of one class in two places. So I would suggest we implement all the methods that need documentation on the child classes and have them call super.

Replicate function signatures in inherited classes (MCRTensor and CGRTensor) from their parent class (BaseMCRTensor) to let sphinx print the documentation of each parent function.

Zeldax64 · 2025-06-12T20:03:01Z

I followed your suggestion and now each documentation page is self-contained. Unfortunately, one of the tests failed for the BSC class; I have no idea why. Is the failure due to randomness?

Zeldax64 added 6 commits May 27, 2025 10:11

Add BaseMCRTensor

d634328

Add a new base class to support future implementation of variations of MCR proposals.

Add CGRTensor

e4d1e99

Add a new VSA class named Cyclic Group Representation (CGR). This class is similar to MCR, but differs in bundling.

Add CGRTensor to functional embeddings

24c2676

Allow its usage in level and circular embeddings as done with MCRTensor.

Fix CGR bundle

9c471ca

Ensure both inputs are in the same shape.

Add CGR into tests

58b75d4

The CGR should behave almost the same as MCR, but diverges in bundling. Implement a custom bundling test for it.

mikeheddes reviewed May 28, 2025

View reviewed changes

Zeldax64 added 5 commits May 29, 2025 12:50

Use Tensor.dim() in CGRTensor

124312a

Shorter syntax and more readable.

Remove asserts in CGR/MCR classes

0290270

No need to check block size in CGR/MCR functions as the __torch_function__() in BaseMCRTensor already checks it.

Improve exception in BaseMCRTensor if multiple block_sizes

7e55632

- Raise ValueError instead of RuntimeError

Add CGRTensor to docs

dacb780

Include CGRTensor in the README and in the built docs.

Add BaseMCRTensor to docs

f65c727

Fix typo in README.md

b7ac97c

Improve documentation for MCRTensor and CGRTensor

2f796b7

Replicate function signatures in inherited classes (MCRTensor and CGRTensor) from their parent class (BaseMCRTensor) to let sphinx print the documentation of each parent function.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Cyclic Group Representation (CGR) class #182

Add Cyclic Group Representation (CGR) class #182

Uh oh!

Zeldax64 commented May 28, 2025 •

edited

Loading

Uh oh!

mikeheddes left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikeheddes commented May 28, 2025

Uh oh!

mikeheddes commented May 28, 2025

Uh oh!

Zeldax64 commented May 29, 2025

Uh oh!

mikeheddes commented Jun 2, 2025 •

edited

Loading

Uh oh!

Zeldax64 commented Jun 12, 2025

Uh oh!

Uh oh!

	block_sizes = set(a.block_size for a in args if hasattr(a, "block_size"))
	if len(block_sizes) != 1:
	raise RuntimeError(
	f"Call to {func} must contain exactly one block size, got {list(block_sizes)}"
	)

Add Cyclic Group Representation (CGR) class #182

Are you sure you want to change the base?

Add Cyclic Group Representation (CGR) class #182

Uh oh!

Conversation

Zeldax64 commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

mikeheddes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikeheddes commented May 28, 2025

Uh oh!

mikeheddes commented May 28, 2025

Uh oh!

Zeldax64 commented May 29, 2025

Parameter checking

Documentation

Uh oh!

mikeheddes commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Zeldax64 commented Jun 12, 2025

Uh oh!

Uh oh!

Zeldax64 commented May 28, 2025 •

edited

Loading

mikeheddes commented Jun 2, 2025 •

edited

Loading