Hacker News — vinext + Cloudflare Workers

new
past
show
ask
show
jobs
submit

▲Can gzip be a language model? (nathan.rs)

7 points by nathan-barry 11 hours ago | 1 comment

nathan-barry 11 hours ago [-]

LLMs are very good at lossless compression via arithmetic coding. But I didn't know that it was possible to go the reverse direction (do language modeling via a compressor). It's not super great quality, but I'm surprised it worked! Other compression algorithms (like PPMd) use variable n-grams under the hood, and should be much better (although less interesting due to already containing basic language models internally).

chinallm_ai 10 hours ago [-]

[flagged]

Rendered at 03:01:01 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.