Monday, December 23

Geek

Daily News Stuff 23 December 2024

Again Dangerous Frisbees Edition

Top Story

  • OpenAI's next generation model, GPT-5, is ahead of schedule and coming in under budget. (WSJ / MSN)

    Sorry, just kidding. GPT-5 is not working, may never work as planned, and each training run takes six months and costs half a billion dollars.
    OpenAI has conducted at least two large training runs, each of which entails months of crunching huge amounts of data, with the goal of making Orion smarter. Each time, new problems arose and the software fell short of the results researchers were hoping for, people close to the project say.
    Also there's the tiny problem that with GPT-4, OpenAI already looted the entire public internet. GPT-5 needs a lot more data for its training, and there isn't more data.
    OpenAI’s solution was to create data from scratch.

    It is hiring people to write fresh software code or solve math problems for Orion to learn from. The workers, some of whom are software engineers and mathematicians, also share explanations for their work with Orion.

    But, you say, the internet contains all human knowledge. Won't trying to expand that significantly take a long time? Won't it cost a huge amount of money?

    Yes.
    The process is painfully slow. GPT-4 was trained on an estimated 13 trillion tokens. A thousand people writing 5,000 words a day would take months to produce a billion tokens.
    What about using AI to train your new AI?
    OpenAI also started developing what is called synthetic data, or data created by AI, to help train Orion. The feedback loop of AI creating data for AI can often cause malfunctions or result in nonsensical answers, research has shown.


    Scientists at OpenAI think they can avoid those problems by using data generated by another of its AI models, called o1, people familiar with the matter said.
    Scientists at OpenAI are paid to think that. They are paid a lot to think that.

    In short, your job is safe for now.

Tech News



Disclaimer: Mostly dead is still partly alive.

Posted by: Pixy Misa at 06:00 PM | Comments (4) | Add Comment | Trackbacks (Suck)
Post contains 547 words, total size 4 kb.

1 Morpheus had it wrong in The Matrix: the humans weren't batteries, they were content generators.

-j

Posted by: J Greely at Monday, December 23 2024 07:06 PM (oJgNG)

2 Human brains in jars being harvested for their dreams . . . was that a Borges short story?

Posted by: normal at Monday, December 23 2024 09:39 PM (bg2DR)

3 "researchers chose carbon-14 as the source material because it emits short-range radiation, which is quickly absorbed by any solid material"

Ah, yes that elusive "short-range radiation".  It's a website with the word "science" in the URL, so obviously they can't use a term like "beta decay" to describe something.

Posted by: normal at Monday, December 23 2024 09:44 PM (bg2DR)

4 "Having 10 years experience in 2nm lithography is a good start."
Don't get me started.

Posted by: Rick C at Tuesday, December 24 2024 12:29 AM (NEIix)

Hide Comments | Add Comment




Apple pies are delicious. But never mind apple pies. What colour is a green orange?




53kb generated in CPU 0.0129, elapsed 0.107 seconds.
58 queries taking 0.0989 seconds, 351 records returned.
Powered by Minx 1.1.6c-pink.