+ Journal

    + News

    + Projects

    + Academia

    + People

    + Backgrounds

    + Forum

    + About

    - Recent Articles

    Escape From the Future: Architecture, Language, and the Computational Turn

    Figure 4.pngThe creators of this online journal and forum controversially argue that computation will engender the final stage of development in the relationship between architecture and computers by ...

    Process/Drawing

    ReasThumb.jpgWriting software is at the core of Casey Reas’s artistic practice. The digital is his medium of choice rather than a means of manipulation. He reflects on ...

    Metaphysics of Genetic Architecture and Computation

    Thumb copy.jpgWith the dissolution of the last utopian project of Man in the name of Communism, the great spectre that once haunted Europe and the rest of the ...

    Nothing Is Random: Automason Ver 2.0

    Figure 1.jpg While computers have dramatically changed the way architects design, construction in the US and around the world is still dependent on ...

    Dazzle Topologies

    EVAN copy.jpg One of the great lessons of the 20th Century that our particular generation of architects has inherited is our appreciation of the infra-thin scale: the primal ...

    Transmitting Iconography

    GRAND2cropb copy.jpg Contemporary telecommunication and computer technologies have fundamentally changed the relationship between sign and space, iconography and matter. While Venturi’s model for the decorated shed grew out ...

    Tectonics, Economics and the Reconfiguration of Practice: The Case for Process Change by Digital Means

    sheldon-thumb.jpgThe current programming culture in architecture could all too easily be written off as a youthful, geeky obsession with the algorithmic and the parametric among nascent practitioners, who ...

    Bodies Unfolding

    Bill and Lila Thumb copy.jpgAfter seeing Buckminster Fuller's Dymaxion World Map, a map projected on a flattened isohedron, we began working on the idea of using computer ...

    Cultural Concerns in Computational Architecture

    perkins G. Holmes Perkins, 1904-2004 In September of 2004 I attended two events that reflect on each other. One was the Non-Standard Praxis conference held at MIT. The ...

    Genetic Architecture

    Genetic Architecture With the dissolution of the last utopian project of Man in the name of Communism, thegreat specter that once haunted Europe and the rest of the world has all but ...

    Automason Version 1.0

    Contemporary architects are judged as much by their buildings as they are by the sophistication of the techniques used in design and construction. A certain fascination with technology is natural ...

    Genomic Architecture

    genomic1.gifGenomic architecture is based on the manipulation of the architectural genome. Like its biological counterpart, this genome is universal and encompasses all architecture — past, present and future. ...
    + more articles

Speech Recognition in 1 Kilobyte

Russian scientists at the Institute of Radio Engineering and Electronics have identified a new method of speech recognition that picks up only the essential elements needed to decipher each word. Nuances from whispering, shouting, tremors are ignored, allowing for the efficiency of 1 kilobyte to store all numerals and a few basic commands as a prototype for mobile devices.

via Science Blog


People recognize familiar words without difficulty regardless of the voice and intonation it is pronounced with. “Six” or “eight” remain six and eight for a person no matter how they are pronounced - in a loud voice or in a whisper, in an excited or a calm voice, by the voice of an old man or a child, by that of a man or a woman. Our brains separate the semantic part from the mass of background sound.

As for a machine, each variant of a voice is unique. That is why speech recognition programs usually require training. As a result of training, an enormous library is built up in the memory of the silicon brain, where thousands of possible options of pronunciation of the same words (for example, numerals) are stored. Having heard a word, the computer would look through the library and almost certainly something similar to the heard word will be found in it.

The approach suggested by the scientists from the Institute of Radio Engineering and Electronics, at the Russian Academy of Sciences, is rather more human than machine: a computer under the researchers’ guidance filters individual peculiarities. It picks out the most basic things and rejects all immaterial ones. As a result, the machine even acquires the ability to discern individual sounds and to put together in its “mind” familiar words from these sounds.

As a result, 1 kilobyte would be sufficient for a processor to confidently recognize all numerals and some simple commands, however, pronounced (although only in Russian at the moment). Several dozen people with far-from-ideal articulation - tried to confuse the quick-witted program, by pronouncing numerals either in a whisper or in a voice trembling with excitement. However, the computer successfully rejected emotional frequencies as irrelevant.

“The prototype software interface developed and established by our specialists for the system of data and management commands voice input is intended for mass mobile electronic devices, says the project manager, Vyacheslav Anciperov.

Perhaps, the most important and fundamentally new aspect of this work is that they have managed to single out essential elements of speech being guided by the notion of hierarchical structure of speech. Like in a musical composition, one can recognize more or less high levels of organization - rhythm, main theme, arrangement, so they have also learned to single out the ranges in the speech flow (i.e. in the wide frequency spectrum), which carry the major semantic loading. It has turned out that this is a very small part of human speech sounds - only up to 1 KHz. “Thus we simplified the task for the computer to the maximum. And one more thing - we have taught the computer to recognize individual sounds, which is sometimes far from easy. As a result, our system wins in processing speed and in processor time and memory consumption as compared to those of all known similar systems. This is the path to efficient speech processors that nobody has passed yet,” he adds.

related : [Robotic Animals : Implanting Mice with Speech Recognition][The Scratchpad][Plants + Self-Recognition][Translator Tech][New AI with Cognitive Vision][Morphology database going global][Technology]

One Response to “Speech Recognition in 1 Kilobyte”

    a gravatar Godly Musings » A new, more efficient speech recognition algorithm

    [...] via Future Feeder [...]

Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>