peter bassill
· operator
about
work
writing
lighter side
talks
my cv
advisory
contact
×
Terminal
Dark
Light
$
grep -l "tag:ml" writing/
tag
:
ml
.
1 piece tagged
ml
, newest first. The full taxonomy is on the
tag index
.
2022·05·26
INT8 quantisation, in numbers — and why INT16 is the boring choice
What "INT8-quantised inference" actually means once you do the arithmetic, why dropping from FP32 to INT8 is a cliff and dropping to INT16 isn't, and why every interesting question about putting an ML model on real silicon ends up here.
ai · ml · quantisation · inference · hardware
11 min
→
all tags
·
all writing
~