• 0 Posts
  • 266 Comments
Joined 8 months ago
cake
Cake day: April 7th, 2025

help-circle














  • vivendi@programming.devtoMicroblog Memes*Permanently Deleted*
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    4 months ago

    One of the absolute best uses for LLMs is to generate quick summaries for massive data. It is pretty much the only use case where, if the model doesn’t overflow and become incoherent immediately [1], it is extremely useful.

    But nooooo, this is luddite.ml saying anything good about AI gets you burnt at the stake

    Some of y’all would’ve lit the fire under Jan Hus if you lived in the 15th century

    [1] This is more of a concern for local models with smaller parameter counts and running quantized. For premier models it’s not really much of a concern.


  • That is different. It’s because you’re interacting with token-based models. There has been new research on giving byte level data to LLMs to solve this issue.

    The numerical calculation aspect of LLMs and this are different.

    It would be best to couple an LLM into a tool-calling system for rudimentary numeral calculations. Right now the only way to do that is to cook up a Python script with HF transformers and a finetuned model, I am not aware of any commercial model doing this. (And this is not what Microshit is doing)