Add group quantization for whisper #429

jimypbr · 2023-06-22T13:25:30Z

What does this PR do?

Adds group quantization custom op to optimum. Adds the functionality to whisper's linear layers.

Still to do:

Automate the custom op build into setup.py
Add quantisation options into IPUConfig or whisper parallelize kwargs.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-07-12T12:55:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…the moment.

setup.py

optimum/graphcore/quantization/group_quantize.py

jimypbr force-pushed the group_quantize branch from 5803044 to 85d0f0e Compare June 22, 2023 13:27

jimypbr force-pushed the group_quantize branch from 1c1d055 to da84fce Compare July 12, 2023 13:57

jimypbr added 10 commits July 13, 2023 14:53

Initial commit of group quantisation. works for whisper inference

8367b3f

Got it working with whisper. Custom ops have to be manually built at …

aa862dd

…the moment.

style fix

b5a09d4

Using V1 of the decompress custom op

eaa643f

Build now working in setup.py

6c5af4d

Remove prints

b519706

Using v1 of the codelet

099a031

Enable poplar and popart environments before installing optimum

ceae6a2

typo

deaffbe

Add debug print

53f30d4

jimypbr force-pushed the group_quantize branch from 0eb20b4 to 53f30d4 Compare July 13, 2023 14:54

jimypbr added 14 commits July 14, 2023 11:49

Fixed custom_ops path search. Debug prints still present.

a6bf0c7

Added explicit_ir_inference option to ipu_config

c7a4a40

Enable poplar/popart environment in code_quality workflow

7ea1e7e

Put black, ruff, and isort installs in github workflow

8110b26

Merge branch 'main' into group_quantize

04d0a1c

Add quotes around pip installs

75584e5

style

ed388a6

Add sdk version hash function

8f08ebe

Use cppimport instead of makefile

e0eb64b

Update setup.py

1c7a6d3

Add copyright and licenses

45ee6ef

Adding log info

b7821e1

Adding test that custom ops compile

117a13f

make style

6c08776

jimypbr marked this pull request as ready for review July 24, 2023 12:18

Revert code_quality workflow changes

f2c415f

jimypbr added 4 commits July 24, 2023 12:22

Revert changes to workflows

a4ad555

Revert changes to MANIFEST.in

7e33c5e

Remove commented out code

26d5142

Add custom_ops/utils.py

3bd465d

jimypbr requested a review from katalinic-gc July 24, 2023 12:27

katalinic-gc approved these changes Jul 24, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

Remove unused import

c0f232e

katalinic-gc reviewed Jul 24, 2023

View reviewed changes

optimum/graphcore/quantization/group_quantize.py Outdated Show resolved Hide resolved

use F.linear

df8cb83

jimypbr merged commit e101001 into huggingface:main Jul 24, 2023

jimypbr deleted the group_quantize branch July 24, 2023 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add group quantization for whisper #429

Add group quantization for whisper #429

jimypbr commented Jun 22, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 12, 2023

Add group quantization for whisper #429

Add group quantization for whisper #429

Conversation

jimypbr commented Jun 22, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jul 12, 2023

jimypbr commented Jun 22, 2023 •

edited

Loading