• 0 Posts
  • 907 Comments
Joined 10 months ago
cake
Cake day: February 10th, 2025

help-circle



  • iPhone notification summaries were made with GPT3.5 I believe (maybe even the -turbo version).

    It doesn’t use reasoning and so when using very short outputs it can produce wild variations since there are not a lot of previous tokens in order to direct the LLM into the appropriate direction in kv-space and so you’re more at the whims of temperature setting (randomly selecting the next token from a SOFTMAX’d list which was output from the LLM).

    You can take those same messages and plug them into a good model and get much higher quality results. But good models are expensive and Apple is, for some reason, going for the budget option.


  • Anyone learning a new language massively benefits from being able to speak with native speakers.

    That being said, LLMs are better at languages and translation tasks than any pretty much anything else. If you need vocabulary help or have difficulty with grammar they’re incredibly helpful (vs Googling and hoping someone had the same issue and posted about it on Reddit).

    I mean, if you can afford a native speaker tutor that is the superior choice. But, for the average person, an LLM is a massive improvement over trying to learn via YouTube or apps.






  • I don’t know the details or the bills, but it isn’t uncommon for people in contested seats to be allowed by the party to vote in a poll-favorable way when their vote won’t change the outcome.

    Simply, counting ‘Times voted with Trump’ doesn’t say much that is useful and can be misleading, especially in the context of a post about death threats to politicians.

    Social media can have a very us or them mentality. If you’re not 100% lock step with the group then you’re an enemy to be scored and attacked. I read that comment as 'Yeah, they’re getting death threats but they voted with Trump so they deserved it (<insert Nazi bar comment>)".



  • Thanks for the recommendation, I’ll look into GLM Air, I haven’t looked into the current state of the art for self-hosting in a while.

    I just use this model to translate natural language into JSON commands for my home automation system. I probably don’t need a reasoning model, but it doesn’t need to be super quick. A typical query uses very few tokens (like 3-4 keys in JSON).

    The next project will be some kind of agent. A ‘go and Google this and summarize the results’ agent at first. I haven’t messed around much with MCP Servers or Agents (other than for coding). The image models I’m using are probably pretty dated too, they’re all variants of SDXL and I stopped messing with ComfyUI before video generation was possible locally, so I gotta grab another few hundred GB of models.

    It’s a lot to keep up with.😮‍💨





  • Great article. It should be noted that your web browser allows websites to do this with javascript.

    Also, since this tracking isn’t really trying to hide, people have compiled lists of the dns names of the destinations of these requests.

    If you host a DNS server and you take that list and return NULL for everyone on the list. Then clients on your network who have ad tracking software will try to look up the destination to send your data and your DNS will tell them that there is nowhere to send the data and so the data isn’t delivered. So, all of the smart TVs, game consoles, refrigerators, toasters and doorknobs which automatically send data but you cannot configure will fail because they use DNS also.

    This sounds complicated to do, but I’m just describing Pi-hole (https://pi-hole.net/). It only takes a few minutes to setup the container and change your router’s DHCP configuration in order to give out the address of the Pi-hole DNS server.

    Assuming you’re on Linux, which you are because you’re a reader of a c/Privacy… right?