Skip Navigation
Why and How Does Python Use Bloom Filters in String Processing?
  • UTF-8 is an encoding for unicode, that means it's a way of representing a unicode string as actual bytes on a computer.

    It is variable length and works by using the first bits of each byte to indicate how many bytes are are needed to represent the current character.

    Python also uses an encoding, as you describe in the article, but it's different to UTF-8. Unlike unicode, all characters in Python's representation of the unicode string use the same number of bytes, which is the maximum that any individual unicode character in the string needs.

    I'd probably mess up a more detailed explanation of UTF-8 or Python's representation, so I'll let you look into how they work in more detail if you're interested.

  • Why and How Does Python Use Bloom Filters in String Processing?
  • The article says that CPython represents strings as UTF-8 encoded, which is not correct. The details about how it works are correct, just that's not UTF-8.

    That's just a minor point though, nice article.

  • How many people actually dropped Reddit for Lemmy?
  • Well I kept using it until Infinity died, which was only at the start of this month!

    If I do decide to go back, it will be by compiling the infinity APK with my own API key, but I'm not feeling much of an urge to bother at the moment.

  • After 6 hours
  • It probably really depends on the project, though I'd probably try and start with the tests that are easiest/nicest to write and those which will be most useful. Look for complex logic that is also quite self-contained.

    That will probably help to convince others of the value of tests if they aren't onboard already.

  • Simply explained: how does GPT work?
  • I think calling it just like a database of likely responses is too much of a simplification and downplays what it is capable of.

    I also don't really see why the way it works is relevant to it being "smart" or not. It depends how you define "smart", but I don't see any proof of the assumptions people seem to make about the limitations of what an LLM could be capable of (with a larger model, better dataset, better training, etc).

    I'm definitely not saying I can tell what LLMs could be capable of, but I think saying "people think ChatGPT is smart but it actually isn't because <simplification of what an LLM is>" is missing a vital step to make it a valid logical argument.

    The argument is relying on incorrect intuition people have. Before seeing ChatGPT I reckon if you'd told people how an LLM worked they wouldn't have expected it to be able to do things it can do (for example if you ask it to write a rhyming poem about a niche subject it wouldn't have a comparable poem about in its dataset).

    A better argument would be to pick something that LLMs can't currently do that it should be able to do if it's "smart", and explain the inherent limitation of an LLM which prevents it from doing that. This isn't something I've really seen, I guess because it's not easy to do. The closest I've seen is an explanation of why LLMs are bad at e.g. maths (like adding large numbers), but I've still not seen anything to convince me that this is an inherent limitation of LLMs.

  • How do I remove unnecessary python packages? At global and in vnev?
  • Yeah, my experience with docker on windows has been pretty bad, uses high CPU and RAM at the best of times, at the worst completely hangs my computer on 100% CPU usage forcing a restart as the only fix.

    I really don't understand why people are overcomplicating this. You can install multiple Python versions at once on Windows and it just works fine (you can use the py command to select the one you want).

    Virtual environments are designed exactly for this use case. They've got integrations for pretty much everything, they're easy to delete/recreate, they're really simple to use, they're fast, and they just work.

    If virtual environments alone aren't quite enough you can use something like poetry or pipenv or the many other package management options, but in many cases even that is overkill.

  • Python Security Developer-in-Residence – Weekly Report #2
    sethmlarson.dev Security Developer-in-Residence – Weekly Report #2

    This critical role would not be possible without funding from the OpenSSF Alpha-Omega Project. Massive thank-you to Alpha-Omega for investing in the security of the Python ecosystem! Second ...

    Security Developer-in-Residence – Weekly Report #2

    Featuring:

    • Identifying bundled libraries on Python distributions to help scanning for vulnerabilities.
    • Trusted Publisher adoption metrics.
    • GitHub push protection for PyPI API tokens.

    Lots of great stuff!

    0
    GitLab Outage - c/DevOps
  • Thanks for the info on crossposting! I thought I'd seen someone mention a cross posting feature but couldn't see any button to do it. I'm using the Jerboa app on Android which I guess doesn't have that button, but I see it on the website now as you say.

    It's also good to know that linking to the original URL is generally better and the rest can be handled by the UI - that does seem nicer.

  • GitLab Outage - c/DevOps

    Cross Posting from DevOps, I hope this is the correct way of doing that and it's considered acceptable.

    0
    GitLab Outage - c/DevOps

    Cross Posting from DevOps, I hope this is the correct way of doing that and it's considered acceptable.

    2
    GitLab Outage
    status.gitlab.com GitLab System Status

    This status page is exclusively intended for monitoring GitLab.com and its associated services. If there are any performance or service interruptions, an update will be posted here. If you are encountering service interruptions on GitLab Dedicated, please submit a support request.

    Seems to be back up now, still waiting for information on the cause, so far have the vague reason "config change" from the GitLab issue linked

    0
    InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)QW
    qwop @programming.dev
    Posts 4
    Comments 49