Quote from: A on June 04, 2024, 19:21:04But for basic tasks like text to voice
Everything is exactly the opposite - the key basic task for the average consumer from "AI" is voice to text. And even in a difficult noise environment.
Or on-the-fly translation of a conversation or "dubbing" into a movie with perfect preservation of the intonation (including the sound atmosphere with reverberations) of the original actors and the exact semantic correspondence of the text to the original. And this does not take into account jokes, which are often understandable only to speakers of the particular language in which the film was created, which requires additional intellectual effort from the "AI" to understand what analogue of a joke or sarcasm in one language can correspond in another language. These are real "super" tasks, although they are banal. And the text in voice - who needs that anyway? Besides the blind?
Well, what SoC is capable of doing this today with a minimum level of errors in any language on the planet?
The correct answer is that even the super-computers in the Top 10 are not capable of this. Not to mention the ridiculous models that fit into children's NPUs of consumer SoCs.
I repeat once again - the modern fashion for "AI" is nothing more than a convenient reason for increasing hardware prices, because even large data centers with giant networks that are several orders of magnitude superior in performance to an ordinary SoC are not able to solve this seemingly "banal" problem. household tasks.
It will take at least another 50-70 years before the average consumer actually gets in their pocket something similar to something that is capable of solving the above 2 banal everyday problems from the point of view of the average person.
The capabilities of ordinary hardware must increase by hundreds of millions of times + thousands of new technologies at all levels in order for such solutions to become possible on a mass scale. And this will really change civilization - when language barriers are almost completely erased, and communication with the machine will be in an ordinary voice (and then at the level of transmitting mental commands).
It's like trying to build a household refrigerator 1000 years ago. Even understanding the goal - without thousands of related technologies and industrial chains, this was impossible.
Even rifled firearms became possible only after the advent of hundreds of other technologies.
"Any sufficiently advanced technology is indistinguishable from magic" (c) A. C. Clarke