I’m just a nerd girl.

  • 3 Posts
  • 27 Comments
Joined 1 year ago
cake
Cake day: March 4th, 2024

help-circle
  • Rose@lemmy.worldtoProgrammer Humor@programming.devchoas
    link
    fedilink
    English
    arrow-up
    30
    ·
    27 days ago

    Well, sure, with an image classifier, the bird identification is doable. I’m sure I could implement that if I went looking for some open source thingamabob that does that. But it’s still not something I could actually understand. That part definitely hasn’t changed over the years.








  • Rose@lemmy.worldtoTechnology@lemmy.world*Permanently Deleted*
    link
    fedilink
    English
    arrow-up
    62
    ·
    3 months ago

    I have no idea why the makers of LLM crawlers think it’s a good idea to ignore bot rules. The rules are there for a reason and the reasons are often more complex than “well, we just don’t want you to do that”. They’re usually more like “why would you even do that?”

    Ultimately you have to trust what the site owners say. The reason why, say, your favourite search engine returns the relevant Wikipedia pages and not bazillion random old page revisions from ages ago is that Wikipedia said “please crawl the most recent versions using canonical page names, and do not follow the links to the technical pages (including history)”. Again: Why would anyone index those?











  • I run ad blockers. As a security measure. Ad companies collect insane amount of data and do a bunch of shady stuff whenever they can get away with it.

    I want to support websites whenever I’m able, but the way ad companies operate just ain’t it.

    If they clean up their act, maybe then I could stop using ad blockers, but it’s been decades and I don’t have high hopes.

    Also using ad blockers for performance and usability reasons. For example, I used to use a bunch of Fandom wikis and couldn’t understand why people hated the UI. Then I saw how Fandom looks like without ad blockers and holy shit how can humans live like this