metal-quant-ext2

metal-quant-ext2 is a repository of my research on PyTorch MPS kernel extensions using Apple Metal. The name includes quant which implies some quantization.

Goal is to develop mps pytorch extensions for efficient local fine-tuning using pytorch huggingface models and TRL

The study includes:

Requirements

MacOS 15.3.1 or later
Python 3.12
pytorch

Usage

pip3 install -r requirements.txt
pip3 install --ignore-installed .

Blockwise Quantization 8-bit

blockwise_quant is a function that applies symmetric blockwise 8-bit quantization to a pytorch tensor

from metal_quant_ext2 import blockwise_quant, dequantize
mps_device = torch.device("mps")

input_tensor = torch.randn(1024, device=mps_device, dtype=torch.float32)

quantized = torch.empty_like(input_tensor, dtype=torch.int8) # Will inherit device from input_tensor (MPS)

scales = torch.empty(num_blocks, device=cpu_device, dtype=torch.float32)

offsets = torch.empty(num_blocks, device=cpu_device, dtype=torch.float32)

# the actual MTL call
blockwise_quant(input_tensor, quantized, scales, offsets)

print(f"quantized: {quantized}")
assert torch.all(quantized.cpu() >= -127) and torch.al(quantized.cpu() <= 127)

# Dequantize MTL call
scales = scales.to(mps_device)
output = torch.empty_like(input_tensor)
dequantize(quantized, scales, output)

Testing

Check out the test file with assertions test-blockwise-quant.py

Blockwise Quantization Example

Below is a python script that helped me understand blockwise quantization

code-samples/blockwise-quantization.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code-samples		code-samples
CustomOP.metal		CustomOP.metal
CustomOP.mm		CustomOP.mm
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
test-blockwise-quant.py		test-blockwise-quant.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

metal-quant-ext2

Requirements

Usage

Blockwise Quantization 8-bit

Testing

Blockwise Quantization Example

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jedt/metal-quant-ext2

Folders and files

Latest commit

History

Repository files navigation

metal-quant-ext2

Requirements

Usage

Blockwise Quantization 8-bit

Testing

Blockwise Quantization Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages