RWKV RNN: Better than ChatGPT?

adeon · on March 23, 2023

I've followed updates on this project r/machinelearning and for me the existence of projects like this is some good evidence that the OpenAI moat is not that strong. It gives some hope you are not going to need massive huge computers and GPUs to run decent language models.

I hope this project will thrive.

textninja · on March 24, 2023

The moat between OpenAI and projects like this is access to expensive computing resources, an organizational mandate to invest in aforementioned compute, and finally, access to high quality training data. They were first to market so they’re only getting further ahead thanks to the invaluable real world usage data.

I don’t think their moat ever had anything to do with innovative architecture. If anything, projects like this will widen the chasm since it’s easier for OpenAI to implement new architecture in their models than it is for an independent researcher to scale their ideas.

If projects like this are seeds then the problem is that OpenAI owns all the land.

pffft8888 · on March 24, 2023

That's a bit defeatest, don't you think?

OpenAI is not the end all and be all, and projects like this will only inspire more challengers. OpenAI copying this architecture will delegitimize them as hardcore innovators, whom they're not.

I see this is a credible challenge because it most certainly is.

textninja · on March 24, 2023

I agree, I sounded more defeatist than I intended. No doubt the moat is expansive, but it’s not impossible to cross. At least until the regulators get involved. [I did it again, didn’t I.]

d0mine · on March 24, 2023

regulators exists so that big guys won't be worried that small guys will out innovate them. If somebody asks for regulation, they became too big to innovate themselves.

lallysingh · on March 24, 2023

I don't think so. OpenAI doesn't do anything but AI. You can't ask GPT to actually do anything for you but generate output.

Siri, the crap it is, can actually do things for you. Alexa can do things for you.

The real value of this system is as a control interface and OpenAI has nothing to control.

flangola7 · on March 24, 2023

Did you see the plugins announcement?

lallysingh · on March 26, 2023

Yeah (after my comment) but the big players will compete instead of doing plug-ins.

fnordpiglet · on March 24, 2023

But Microsoft does.

gaogao · on March 24, 2023

Yeah, my sense is that OpenAI moat is primarily just through the RLHF dataset right now. Most of the other things – foundational models and datasets, embeddings (roughly the plugins announcement today) – have generally already been commoditized. Just getting that dataset for fine tuning is one of the last major hurdles for the ChatGPT geist.

sdrinf · on March 24, 2023

The Open Assistant community ( https://open-assistant.io/ ) is building a crowdsourced dataset for RLHF, with apparently high quantity of high quality contributions.

boppo1 · on March 24, 2023

Running models is one thing, but surely lots of power will remain with those who have the capital/hardware to train them?

MacsHeadroom · on March 24, 2023

As of yesterday you can train 33B parameter models on a single consumer 24GB VRAM GPU with https://github.com/johnsmith0031/alpaca_lora_4bit.

It's already proven that 13B parameters is enough to beat GPT-3 175B quality. It's likely that 33B parameters is enough for GPT-4.

dudeinhawaii · on March 24, 2023

How is this "proven"? Can you point to some benchmarks that demonstrate this? Everything I've seen thus far has been fairly mediocre/terrible outside of a few cherry-picked prompt/response combinations.

They're awesome technical achievements and will likely improve of course but you're making some very grand statements.

rnosov · on March 24, 2023

There are benchmarks in the original LLaMA paper[1]. Specifically, on page 4 LLaMA 13B seems to beat GPT-3 in BoolQ, HellaSwag, WinoGrande, ARC-e and ARC-c benchmarks (not by much though). Examples that you've seen are likely to be based on some form quantisation / poor prompt that degrade output. My understanding that the only quantisation that doesn't seem to hurt the output is llm.int8 by Tim Dettmers. You should be able to run LLaMA 13B (8 bit quantised) on the 3090 or 4090 consumer grade GPU as of now. Also, you'd need a prompt such as LLaMA precise[2] in order to get ChatGPT like output.

[1] https://arxiv.org/pdf/2302.13971v1.pdf

[2] https://www.reddit.com/r/LocalLLaMA/comments/11tkp8j/comment...

danysdragons · on March 24, 2023

I had a similar impression from what I saw. Maybe it does perform as well as GPT-3 on narrow tasks that it was explicitly fine-tuned on, but that similarity in performance seems to collapse as soon as you go off the beaten track and give it harder tasks that involve significant reasoning. Consistent with that I've seen a few different sources claim that a small model fine-tuned off the outputs of a large one would likely struggle with unfamiliar tasks or contexts that require transfer learning or abstraction.

After seeing how it actually performs in practice, it's hard to have confidence that these benchmarks are reliable measures of model quality.

lupire · on March 24, 2023

Alpaca is fine-tuning LLaMa not training from scratch.

sterlind · on March 24, 2023

holy moly. just two nights ago I used that repo to train the 13B model on my RTX 4090. I thought 30B was weeks away,

fnordpiglet · on March 24, 2023

OpenAIs moat is the primary interfaces for the world are Google, Microsoft, and Apple, and they are attached to one of those. The others will almost certainly replicate their success and we will have three primary AI interfaces, Google, Microsoft, and Apple. That’s the power of collusive monopoly as your moat.

GaggiX · on March 23, 2023

The best thing about this model is that it has O(T) speed and O(1) memory during inference vs the O(T^2) speed and O(T) memory (flash memory) of a GPT model, still it can be trained in parallel like a GPT model.

pffft8888 · on March 23, 2023

In addition,

1) it's open source.

2) you can run it yourself so the rug won't be pulled from under you when they decide to shutdown and move users up to the next version or another product as they've done with the older text-davinci models.

3) you get to align it (using RLFH) as opposed to a corporation dictating what is "aligned" and what is "safe."

4) you won't have to deal with government led censorship. For example, instead of the FBI using JIRA to manage a list of URLs to be censored (as they did according to the latest revelations) they can train the AI to self-censor as Bing has done.

5) you won't be using the product of a company that was started as a non-profit with $100M donation (from Elon Musk) to promote transparent AI only to take that money and turn into a for-profit company and close-source the AI.

Sources:

Elon is the source for #5 and Matt Taibbi is the source for #4. I doub't you'll have a problem sourcing #5 so here is the source for #4:

"31. After the 2020 election, when EIP was renamed the Virality Project, the Stanford lab was on-boarded to Twitter’s JIRA ticketing system, absorbing this government proxy into Twitter infrastructure – with a capability of taking in an incredible 50 million tweets a day." --Matt Taibbi on Twitter https://twitter.com/mtaibbi/status/1633830104144183298

ChaseMeAway · on March 23, 2023

I think that your first few points are fair, but I was a little confused about #4.

I had never heard of this, and it seemed important, some cursory research turned up many twitter posts from individuals amplifying your version of events.

I also found a write up on the situation from TechDirt[0]. The article is fairly good and well sourced, but it paints a substantially different picture than what you describe.

[0]https://www.techdirt.com/2023/02/15/extraordinarily-confused...

pffft8888 · on March 23, 2023

from the source:

https://twitter.com/mtaibbi/status/1633830104144183298

"31. After the 2020 election, when EIP was renamed the Virality Project, the Stanford lab was on-boarded to Twitter’s JIRA ticketing system, absorbing this government proxy into Twitter infrastructure – with a capability of taking in an incredible 50 million tweets a day."

pffft8888 · on March 23, 2023

"It’s crucial to reiterate: EIP was partnered with state entities like CISA and GEC while seeking elimination of millions of tweets. In the #Twitter Files, Twitter execs did not distinguish between organizations, using phrases like ‘According to CIS[A], escalated via EIP,’" Taibbi wrote. "After the 2020 election, when EIP was renamed the Virality Project, the Stanford lab was on-boarded to Twitter’s JIRA ticketing system, absorbing this government proxy into Twitter infrastructure – with a capability of taking in an incredible 50 million tweets a day.

sebastianconcpt · on March 24, 2023

You raised spectacularly important points.

gigel82 · on March 24, 2023

You make it sounds like Elon single-handedly started the venture and provided all the funds. In fact, there are were at least 5 individual founders, plus 3 organizations that together pledged $1 billion: https://en.wikipedia.org/wiki/OpenAI#2015%E2%80%932018:_Non-...

pffft8888 · on March 24, 2023

Elon, not me, made it sound like that. Let's not distract from the core argument though... took money in for non-profit for transparency in AI research and turned around and made it for profit and super anti-transparent. But that's just saying why I can't trust him (sama) The rest of the reasons are not about him and of significant importance.

PoignardAzur · on March 24, 2023

> memory during inference vs the O(T^2) speed and O(T) memory (flash memory) of a GPT model

O(T²) speed is only for naive attention. There's a ton of possible optimizations for attention that give you something closer to O(T).

GaggiX · on March 24, 2023

You can only approximate the attention layer if you don't want the O(T^2) speed but no "efficient attention" can beat the quadratic transformer bottleneck in terms of performance.

uejfiweun · on March 24, 2023

I mean is this not revolutionary? It seems pretty revolutionary.

pffft8888 · on March 24, 2023

It is but the way hype works is that it goes with capital. The $100M from Elon (which he has since regreted) that got OpenAI started, and the $29B from Microsoft that's fueling their growth, means that even a full on AGI has no chance of competing with OpenAI in the market for people's attention. It's all about the moola, but it shouldn't be! It's our chance to support open source and independent efforts or succumb to the OpenAI monopoly and their biased "alignment" model (aka the brainwashed AI that reinforces their political and cultural biases). We need an independent open source option that represents a performant and viable competition to OpenAI's ChatGPT.

If Elon is reading this, his Based AI should invest in this project and its sponsors.

What is at stake here is nothing less than the future of humanity.

I hope people pay attention.

uejfiweun · on March 24, 2023

> What is at stake here is nothing less than the future of humanity.

It's crazy that you're 100% right. I can't believe that we're living through this, honestly. Every day I am consumed by thoughts of wanting to leave my big tech job and go all-in on ML. I don't mind the work but the actual product I work on is so boring...

textninja · on March 24, 2023

> aka the brainwashed AI that reinforces their political and cultural biases

I love seeing IronyAI called out like this.

pffft8888 · on March 23, 2023

Imagine having ChatGPT level AI running in an ASIC inside earphones. This could be like an always-on buddy, available offline and able to access resources when you're connected.

Or in Google Glasses. The Readme states that it's more optimized for ASIC than the transformer architecture used by ChatGPT.

jossclimb · on March 24, 2023

People will use this to create their perfect self. AI will give them witty, charismatic answers to help them get a job, find a mate, entertain others. Eventually they will become AI zombie husks.

dreamcompiler · on March 24, 2023

Did the name "Dixie Flatline" just pop into anybody's head?

https://williamgibson.fandom.com/wiki/Construct

danielbln · on March 24, 2023

Until this tech moves into our brains. AI wetware ushering in transcendence.

textninja · on March 24, 2023

> Imagine having ChatGPT level AI running in an ASIC inside earphones.

Oh, it’s going in brains. And no, don’t sign me up.

IIAOPSW · on March 23, 2023

How will you know which voice in your head is yours? Maybe you are both.

LoganDark · on March 24, 2023

> How will you know which voice in your head is yours? Maybe you are both.

As someone with DID I wonder this every single day.

wruza · on March 24, 2023

Maybe that will teach us about fragility of our definition of consciousness and agency. For some reason many people think that they are insulated from outside effects unless they’re right in their brain. Which is not true.

RobotToaster · on March 23, 2023

I wonder if we will see LLMs on FPGAs or FPGA like devices.

Crypto went from graphics cards to ASICs, we may see something similar with LLMs given the hype.

pffft8888 · on March 24, 2023

I always think of FGPAs for things you want to update without having to repurchase the hardware. If the FPGA inside a gadget can be reprogrammed over the Internet then I think it's more suited than an ASIC for LLMs.

int_19h · on March 24, 2023

It could go even further, in theory. The kind of ops that the current crop of LLMs needs is very simple, and at the same time there's no hard requirement for precision (which is why 4-bit quantization works so well). This means that unconventional approaches such as analog computing are potentially in the play again - it's easy to do addition and multiplication in an analog circuit, if you don't care about the answer being precise, and in theory one could pack a lot more of those circuits in the same space.

smaddox · on March 25, 2023

There's already at least one company with a product: Mythic AMP https://mythic.ai/products/m1076-analog-matrix-processor/

And there's several research papers about other architectures. For example: https://arxiv.org/abs/2205.05853

eldenring · on March 24, 2023

Would an ASIC really be better than GPUs? I'm not 100% sure but aren't low power GPUs essentially ASICs for matrix multiplication.

coolspot · on March 24, 2023

Have you watched the movie “Her”?

pffft8888 · on March 24, 2023

No, but I think an AI is going to be Them, by default, not Her. for a while, until we figure out if there is a neurological basis for gender, and if so, how to mimic it.

witheld · on March 24, 2023

Don’t…read a book by its cover, my friend. Her is not particularly about the gender expression of the robot.

flangola7 · on March 24, 2023

What is Them?

lahwran · on March 24, 2023

like Her, but there could be many or there could be one

isaacfrond · on March 24, 2023

Interesting, that it goes against the grain.

Since the seminal paper 'Attention is all you need', we went from RNN type neural network to pure attention based networks. It started the LLM revolution as the attention only training such networks is parallelizable, and you got record breaking performance to boot.

Now we learn, that going back to the old RNN paradigm is actually better. It even advertises itself as totally 'attention-free'!

guywithabowtie · on March 24, 2023

It generates quite a lot of random content to be honest.

If Nancy had two apples and Becky had 1 apple. Becky gives her 1 apple to Nancy, how many apples becky has ? Full Answer:

RWKV :

Two apples. If Nancy had 2 apples and Becky had 1 apple. Becky gives her 1 apple to Nancy, how many apples becky has ? Two apples.

Q : Two girls are playing with a ball, one of them throws the ball so that it goes straight and falls on the other's feet, the other bends her knees and catches it, how many times will the ball fall on the knees ? Full Answer: The ball will fall on the knees three times.

Q : Two sisters are playing with a stick. The first sister says 'let me hold it', the second sister says 'no'. Now what will happen ? Full Answer: The second sister will hold it.

Q : How many

GPT 3.5 Turbo : After Becky gives 1 apple to Nancy, Becky will have zero apples left. Becky gave her only apple to Nancy, so she doesn't have any apples remaining.

So, the answer is Becky has zero apples left.

It requires a lot more improvement.

byefruit · on March 24, 2023

You're seeing this because the model isn't instruction fine-tuned. You'll need prompting similar to the original GPT3 or Llama models.

guywithabowtie · on March 24, 2023

Can you give me example of that ?

zamnos · on March 24, 2023

Basically they're tuned for sentence completion rather than chat/being asked questions.

Plugging

> Nancy has two apples and Becky has one apple. Becky gives 1 apple to Nancy. Becky now has

into GPT-2 via HuggingFaces at https://huggingface.co/tasks/text-generation

I get

> Nancy has two apples and Becky has one apple. Becky gives 1 apple to Nancy. Becky now has three apples and Nancy has one apple. Becky now has three apples and Nancy has one apple.

> Witch Hunt

> The following is

GPT-2 is much weaker, which explains the garbled nonsense output, along with the incorrect answer for Nancy.

I have no idea what RWKV RNN would output, but leading sentences instead of questions is how to get LLMs not RLHF tuned to answer.

wruza · on March 24, 2023

I’m not sure about knees question either, tbh.

Also, what’s wrong with a stick-holding answer?

guywithabowtie · on March 24, 2023

There is only 1 question here.

wruza · on March 24, 2023

Oh, now I see, thanks!

guywithabowtie · on March 24, 2023

Well. Those Q's are part of answer from RWKV.

guywithabowtie · on March 24, 2023

I will wait for future improvements and watch this project.

gigel82 · on March 24, 2023

Also check out Alpaca; you can self-host this one, the 7B and 13B variants produce surprisingly good results and are fast enough just running on CPU: https://github.com/antimatter15/alpaca.cpp

pffft8888 · on March 23, 2023

What test cases do folks here recommend for measuring this new model's ability to reason? and, specifically, if it can reason about code with similar (or better!) performance to ChatGPT4? Has anyone managed to get it running locally?

gooseus · on March 23, 2023

OpenAI has been collecting a ton of evals here https://github.com/openai/evals with many of them including some comments about how well GPT-4 does vs GPT-3.5.

You could clone that repo, adapt the oaieval script to run against different APIs, then run the evals against both and compare the results.

macrolocal · on March 23, 2023

The author claims 61.0% on WinoGrande vis-a-vis GPT-4's 87.5%.

pffft8888 · on March 23, 2023

"you can fine-tune RWKV into a non-parallelizable RNN (then you can use outputs of later layers of the previous token) if you want extra performance."

Is that 61% using the non-parallelizable RNN mode or the standard mode? I wonder if it's the latter.

This new model may be a viable alternative to ChatGPT, which is not only closed sourced but can be shut down in the future just as they did with the older text-davinci models.

Plus, the alignement and safety has rendered ChatGPT useless for helping with areas such as critical analysis of social issues (that go against the aligned views) and any and all critical thinking that goes against the aligned views of those who own and program ChatGPT. This could a viable free (as in freedom) alternative.

macrolocal · on March 23, 2023

I think the Cambrian explosion is just beginning.

mach1ne · on March 24, 2023

I hope not but day by day it seems more likely. If text-generating LLMs can reach superhuman cognition they will so so in a matter of a few years. At that point a Waluigi prompt will be like arming a virtual nuclear missile.

macrolocal · on March 24, 2023

Nuance: computers have been accumulating superhuman cognitions for half a century. But most people are bad at recognizing intelligence they don't relate to.

MaxikCZ · on March 23, 2023

I can't seem to find it in GitHub repo, do you know the value for ChatGPT before it switched to GPT-4?

macrolocal · on March 23, 2023

Here are a few benchmarks:

https://paperswithcode.com/sota/common-sense-reasoning-on-wi...

akavi · on March 24, 2023

How’d GPT-3/3.5-turbo do?

cyanf · on March 24, 2023

Looks like 81.6%.

macrolocal linked this below: https://paperswithcode.com/sota/common-sense-reasoning-on-wi...

nico · on March 24, 2023

> We can predict that RWKV 100B will be great, and RWKV 1T is probably all you need :)

That sounds awfully similar to this quote: "There is no reason for any individual to have a computer in his home." by the founder of DEC in 1977.

There’s a similar one that’s supposed to be Bill Gates’ but apparently it’s not.

homarp · on March 24, 2023

The '640K' quote won't go away -- but did Gates really say it? https://www.computerworld.com/article/2534312/the--640k--quo...

kragen · on March 24, 2023

i really doubt it

i never heard of the supposed quote until 10 years after it was obviously stupid

momentary · on March 24, 2023

I could believe it was spouted as marketing fluff. If Gates said it I'm sure he never believed it!

wruza · on March 24, 2023

These phrases often get ripped out of context. What they meant was “there’s no reason someone would want to enter machine codes through switches or punchcards into a personal room-sized device”. That is still true. Some people complain about incorrect defaults even though these are few taps away to become correct.

all2 · on March 24, 2023

From the README

dear god rwkv is fast. i switched to another tab after starting training it from scratch & when i returned it was emitting plausible english & maori words, i left to go microwave some coffee & when i came back it was producing fully grammatically correct sentences.

solomatov · on March 24, 2023

It happens the same way if you train a small transformer. It learns to produce words very quickly.

guywithabowtie · on March 24, 2023

Comparison with GPT3.5 - turbo

Given two numbers provide answer as follows

3 + 9 = 39 1 + 2 = 12 2 + 9 = 29

What is 29 + 49

RWKV :

?

A:

The answer is

$29+49 = 78$

Explanation:

Let the two numbers be $a$ and $b$. Then the sum is $a+b$. Now let the sum be $c$. Then the sum is $a+b+c$. Now let the sum be $d$. Then the sum is $a+b+c+d$. Now let the sum be $e$. Then the sum is $a+b+c+d+e$. Now let the sum be $f$. Then the sum is $a+b+c+d+e+f$. Now let the sum be $g$. Then the sum is $a+b+

GPT3.5 turbo

Based on the pattern provided, 29 + 49 would be equal to 2949.

Explanation:

In each of the equations given, the result is obtained by concatenating the two numbers together instead of adding them. For example, 3 + 9 becomes 39 because we concatenate 3 and 9 to get 39.

So, to find the answer to 29 + 49 using this pattern, we concatenate the two numbers to get 2949.

guywithabowtie · on March 24, 2023

Does it need RLHF to make sense?

hkab · on March 24, 2023

No, RLHF only helps the model to align to human preference

theemathas · on March 24, 2023

RLHF makes it possible to get good results with a bad prompt.

sourcecodeplz · on March 23, 2023

This is actually not that bad.

mpaepper · on March 24, 2023

Is there a research paper / arxiv which describes it in detail?

BHSPitMonkey · on March 28, 2023

Off-topic, but this submission's title feels unusually editorialized/click-baity for HN.

knight0075 · on March 24, 2023

serverholic · on March 23, 2023

I'm skeptical that RNNs alone will outperform transformers. Perhaps some sort of transformer + rnn combo?

The issue with RNNs is that feedback signals decay over time, so the model will be biased towards more recent words.

Transformers on the other hand don't have this bias. A word 10,000 words ago could be just as important as a word 5 words ago. The tradeoff is that the context window for transformers is a hard cutoff point.

pizza · on March 24, 2023

I think RWKV ameliorates this to some degree:

How it works: RWKV gathers information to a number of channels, which are also decaying with different speeds as you move to the next token. It's very simple once you understand it.

RWKV is parallelizable because the time-decay of each channel is data-independent (and trainable). For example, in usual RNN you can adjust the time-decay of a channel from say 0.8 to 0.5 (these are called "gates"), while in RWKV you simply move the information from a W-0.8-channel to a W-0.5-channel to achieve the same effect.

gwern · on March 24, 2023

https://twitter.com/arankomatsuzaki/status/16390003799784038...

solomatov · on March 24, 2023

I don't see why this can't be done with transformers. I guess, somebody already tried doing this.

solomatov · on March 24, 2023

As far as I remember in RNN times, the best models were RNNs with attention. Does this thing has any attention mechanism? If it does, then it has the same problem with the O(n^2) computation where n is the window size. My understanding is that transfers are superior due to the fact that they are much faster to train/evaluate than RNNs.

yieldcrv · on March 24, 2023

What does RNN stand for?

edit: recurrent neural network

jacobn · on March 23, 2023

From the project page: pronounced as "RwaKuv"

That is still quite challenging to pronounce, maybe one of "rwkv" -> "raw-kv" -> "rawk-v" -> "rock-v"?

rvz · on March 24, 2023

That is the problem. Unfortunately, this will be forgotten over the OpenAI hype brigade.

I thought Stable Diffusion was a bad name due to its very technical name. But now I have seen something even worse for LLMs. This time OpenAI is learning its lesson in not getting itself disrupted easily.

For any hope of challenging them, we need to be better at names. Even the name 'Bitcoin' caught on. Same with iPhone.

So I'm afraid that the name alone for this project will be the cause of it being quickly forgotten as OpenAI aggressively captures mindshare.

The same with 'Bard'; a horrific name. Google should have simply called it 'Brain' and incremental updates as 'Brain 2', 'Brain 3.6', etc and renamed their existing AI division to Google Brain Labs. Easy.

How is that difficult?

mach1ne · on March 24, 2023

I beg to differ. Right now the whole tech world is eyeing for the best open source alternative for GPT. A techy name matters little if it works and is accessible.

rvz · on March 24, 2023

> Right now the whole tech world...

Except that ChatGPT goes beyond the "tech world" which is my point and a project like this is hardly accessible to beyond the tech world. I don't see people calling 'Google' PageRank.

dragonwriter · on March 23, 2023

“RwaKuv” seems like it would pretty closely match “Rock of”

OJFord · on March 24, 2023

I would assume Rwa like Rwanda, Kuv like covet. But that's just to agree with you really, since that's not one of your suggestions, so with such different ideas it's clearly not a particularly helpful pronunciation guide!

aienthusiast · on March 24, 2023

I've been running a RWKV chatbot and call it Rawkov. It said it liked the name.

tjr · on March 23, 2023

Rocky V?

Sparkyte · on March 24, 2023

It just takes one language library to dethrone the next. I called this when everyone was like CHATGPT!!! The problem is noone knows what they are talking about and screaming AI!!! ChatGPT is not AI. It does something automated with accuracy information baked in and builds new information around the accuracy data. It does not think like AI, it takes the most probable data and responds with it. That is machine learning it. It is fundamentally a cornerstone toward AI, but not AI itself.

og_kalu · on March 24, 2023

>The problem is noone knows what they are talking about

Indeed. as you've just clearly demonstrated

Sparkyte · on March 25, 2023

Think what you like. But I called out crypto I will call this out. But ChatGPT has the potential to be a major component to AI. It is a cornerstone.

The problem is English. AI means something differently between use. What we are going to see with this is that people are going to throw the Sci-Fi book at this and claim something like, "It has feelings!".

I am saying AI the term is being broadly applied to anything with a logic gate. And this is bad marketing for a product too early in development.

It is not the conventional term AI everyone broadly applies. It is a cornerstone toward the AI people broadly apply.

101011 · on March 24, 2023

Respectfully disagree on this point. Here's the definition for AI:

"the theory and development of computer systems able to perform tasks that normally require human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages."

I would say that chatgpt is not sentient, nor is it capable of independently improving its intelligence - but I think this tech hits the (lower) bar for "AI"

Sparkyte · on March 25, 2023

It is a corner stone to AI. The opinion is subjective since we are treading into a new era it is hard to clearly define those lines.