• 8 Posts
  • 271 Comments
Joined 2 years ago
cake
Cake day: July 1st, 2023

help-circle
  • Ken Cheng is a great satirist and probably knows thats not how it works anymore. Most model makers stopped feeding random internet user garbage into training data years ago and instead started using collections of synthetic training data + hiring freelance ‘trainers’ for training data and RLHF.

    Oh dont worry your comments are still getting scraped by the usual data collection groups for the usual ad selling and big brother bs. But these shitty AI poisoning ideas I see floating around on lemmy practically achieve little more than feel good circle jerking by people who dont really understand the science of machine learning models or the realities of their training data/usage in 2025. The only thing these poor people are poisoning is their own neural networks from hyper focusing defiance and rage on a new technology they can’t stop or change in any meaningful way. Not that I blame them really tech bros and business runners are insufferable greedy pricks who have no respect for the humanities who think a computer generating an image is the same as human made art. Also its bs that big companies like meta/openAI got away with violating copyright protections to train their models without even a slap on the wrist. Thank goodness theres now global competition and models made from completely public domain data.


  • Which ones are not actively spending an amount of money that scales directly with the number of users?

    Most of these companies offer direct web/api access to their own cloud supercomputer datacenter, and All cloud services have some scaling with operation cost. The more users connect and use computer, the better hardware, processing power, and data connection needed to process all the users. Probably the smaller fine tuners like Nous Research that take a pre-cooked and open-licensed model, tweak it with their own dataset, then sell the cloud access at a profit with minimal operating cost, will do best with the scaling. They are also way way cheaper than big model access cost probably for similar reasons. Mistral and deepseek do things to optimize their models for better compute power efficency so they can afford to be cheaper on access.

    OpenAI, claude, and google, are very expensive compared to competition and probably still operate at a loss considering compute cost to train the model + cost to maintain web/api hosting cloud datacenters. Its important to note that immediate profit is only one factor here. Many big well financed companies will happily eat the L on operating cost and electrical usage as long as they feel they can solidify their presence in the growing market early on to be a potential monopoly in the coming decades. Control, (social) power, lasting influence, data collection. These are some of the other valuable currencies corporations and governments recognize that they will exchange monetary currency for.

    but its treated as the equivalent of electricity and its not

    I assume you mean in a tech progression kind of way. A better comparison might be is that its being treated closer to the invention of transistors and computers. Before we could only do information processing with the cold hard certainty of logical bit calculations. We got by quite a while just cooking fancy logical programs to process inputs and outputs. Data communication, vector graphics and digital audio, cryptography, the internet, just about everything today is thanks to the humble transistor and logical gate, and the clever brains that assemble them into functioning tools.

    Machine learning models are based on neuron brain structures and biological activation trigger pattern encoding layers. We have found both a way to train trillions of transtistors simulate the basic information pattern organizing systems living beings use, and a point in time which its technialy possible to have the compute available needed to do so. The perceptron was discovered in the 1940s. It took almost a century for computers and ML to catch up to the point of putting theory to practice. We couldn’t create artificial computer brain structures and integrate them into consumer hardware 10 years ago, the only player then was google with their billion dollar datacenter and alphago/deepmind.

    Its exciting new toy that people think can either improve their daily life or make them money, so people get carried away and over promise with hype and cram it into everything especially the stuff it makes no sense being in. Thats human nature for you. Only the future will tell whether this new way of precessing information will live up to the expectations of techbros and academics.


  • Theres more than just chatgpt and American data center/llm companies. Theres openAI, google and meta (american), mistral (French), alibaba and deepseek (china). Many more smaller companies that either make their own models or further finetune specialized models from the big ones. Its global competition, all of them occasionally releasing open weights models of different sizes for you to run your own on home consumer computer hardware. Dont like big models from American megacorps that were trained on stolen copyright infringed information? Use ones trained completely on open public domain information.

    Your phone can run a 1-4b model, your laptop 4-8b, your desktop with a GPU 12-32b. No data is sent to servers when you self-host. This is also relevant for companies that data kept in house.

    Like it or not machine learning models are here to stay. Two big points. One, you can self host open weights models trained on completely public domain knowledge or your own private datasets already. Two, It actually does provide useful functions to home users beyond being a chatbot. People have used machine learning models to make music, generate images/video, integrate home automation like lighting control with tool calling, see images for details including document scanning, boilerplate basic code logic, check for semantic mistakes that regular spell check wont pick up on. In business ‘agenic tool calling’ to integrate models as secretaries is popular. Nft and crypto are truly worthless in practice for anything but grifting with pump n dump and baseless speculative asset gambling. AI can at least make an attempt at a task you give it and either generally succeed or fail at it.

    Models around 24-32b range in high quant are reasonably capable of basic information processing task and generally accurate domain knowledge. You can’t treat it like a fact source because theres always a small statistical chance of it being wrong but its OK starting point for researching like Wikipedia.

    My local colleges are researching multimodal llms recognizing the subtle patterns in billions of cancer cell photos to possibly help doctors better screen patients. I would love a vision model trained on public domain botany pictures that helps recognize poisonous or invasive plants.

    The problem is that theres too much energy being spent training them. It takes a lot of energy in compute power to cook a model and further refine it. Its important for researchers to find more efficent ways to make them. Deepseek did this, they found a way to cook their models with way less energy and compute which is part of why that was exciting. Hopefully this energy can also come more from renewable instead of burning fuel.


  • Theoretically you may be able to store the core seed information that encodes the starting constants that lead to the beginning of the universe. Its not really the same thing like the difference between a cake and the recipe used to make it. Information systems can be distilled to core seed equations and regenerated by iterating that equation many times. This is Barnsley’s collage theorem.




  • This meme was originally made for the !netsphere@sopuli.xyz community in an attempt to give a super niche manga artist fan place a little bit of engagement, I crossposted it here as an afterthought didn’t expect go be brigaded so hard by armchair memologist over the objective definition and location of the funny.

    You’re absolutely correct that you need to have read the Blame! manga to get the reference on this one to really enjoy, even if you did its not that deep. Not too much thought went into it I was high as shit just pasting icons with the ‘linux chad big energy beam, windows/microsoft wojak bad guys its fired at’. Im personally okay with not every one of my memes being super accessible or community bangers I had fun making this and putting the template together. If you get the humor or like the template great. If you don’t, oh well downvote say ‘where funny’ and move on with your life cause im not wasting my time explaining what a graviational beam emitter is to snobs who don’t care in the first place.




  • If money isnt a big issue and you want something truly beefy for a solar system I would recommend something like this then. Your solution is essentially a usbc-pd car charger without the ability to remove from a cigarette plug. You would achieve the same affect wiring up a female cigarette car plug and buying a regular pd car charger with the bonus of being able to swap the outlet out for other 12v car plugs as needed.

    If you want an integrated charger thats fine though at the end of the day theyre all just fancy variable dc to dc converters that take in 12-24v and pop out the usbc-pd voltage ranges as rated. just wanted to give you some options.

    Im a electical engineer and made my own 200w solar system. I feel your pain had to mcguyver some stuff to run off usbcpd. LMK if you want to talk shop. Related guide I wrote explaining USBC-pd and dc-to-dc on lemmy




  • Okay I think the term ‘foot-gun’ is supposed to evoke the image of someone loading a gun and pointing it at their own foot. I can’t help trying to picture a gun thats operated by a foot. Like a mech suit with a robot leg that also fires massive tank shattering shells when you do a roundhouse kick as a human operator. Or a veteran prosthetic leg that’s also a rifle when you kick it the right way.

    The brain rot seeps just a little bit more every time I see the term ‘foot-gun’ please help.







  • SmokeyDope@lemmy.worldtoFound Satan@lemmy.worldTrolling fit bros
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    2
    ·
    edit-2
    25 days ago

    But is it a figurative burying or a meta-literal one? I mean if you really think about things entomologically and we pick apart the Latin root words of “bury” and “dogpile” we might just find that the meaning of lemmy dogpile changes completely depending on context, literally figuratively.

    Language is fucked. We really need telepathically beaming abstract concepts directly into brain matter so I don’t have to crawl through linguist brainrot reply chains.



  • “Look, I didn’t spend months hyperobsessing on /x/, digging through thousands of succubus tupla summoning post and self-inflicting disassociative psychosis to vizualize a vampire GF only to deal with my problems in a healthy normie sheeple NPC way. Fuck that. I mean how would I even tell them about the goat sacrifices? Oh I can still hear the bleeting sometimes. Now shut up, Lumali is warning me about the reptillians under the flat hollow earth slowly replacing my loved ones with robot clones to spy on me again.”


  • SmokeyDope@lemmy.worldtoFunny@sh.itjust.worksDefine Greenwashing
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    2
    ·
    edit-2
    26 days ago

    Im an extremest “No the glass is at precisely half capacity. I meticulously found a geometric proof to show its the fill line is exactly at half of the glass. I counted the fucking molecules one at a time and statistically compared them with a full glass and a empty glass. Its half, no more no less. Both of your perceptions of the world are deeply warped by internal biases.” Kind of guy myself. Or in Lemmy speak, a FiLtHy CeNtaRist!!!