Sample Vs Frequency Packets.

vicktech · October 12, 2010, 6:47pm

Bad thing about samples is mangling sound and duration when frequency changed by increasing or decreasing playback speed.
Bad thing about instruments - they need many samples to supress sample mangling. And they need volume corrections. Because higher frequencies put more energy out than low ones at same amplitude.
How about doing some RE-SEARCH in sound architecture and try to represent sample as frequency packet.
Sample will become sound processor ready. And despite wave structure will differ from original, even each time, - the sounding will be almost exact.
Frequency-energy compensation can be build-in feature of sound processor routine.

Besides sample will no more be affected by frequency change.
Instrument will obtain completely renewed meaning and become powerful and advanced as it ought be from the very beginning.

(Please no copyright claims, this is for free use)

vicktech · October 13, 2010, 5:38pm

Please vote after reading

Garf · October 13, 2010, 6:30pm

Not sure exactly what you mean, but it sound a lot like time stretching?

rhowaldt · October 13, 2010, 6:51pm

i don’t follow either. please elaborate, or try it with some examples.

vicktech · October 13, 2010, 11:34pm

Well…, instead of direct sampling at fixed sample rate, - frequency spectrum analyzing, and finally leasing ought be performed. Because human’s ear hears sound by using over 40 000 sensors, each responding to its own frequency, (at least I know it from somewhere, if I’m not mistaken).

Then sample will consist of frequency packets and amplitudes, both represented by precise digital arithmetics.

Thus frequency can be changed with no harm to play-time of sample.

Kind of. But more native.

It-Alien · October 14, 2010, 12:01am

as far as I know, the human ear (and not only human one) works more as an amplitude modulator, since each of its parts are pressure sensors, rather than frequency sensors.

your idea, although is quite vague, may work, but would invalidate years of development of audio processing based on fixed frequency sampling, also requiring much more processing power and more complex structures. I’m not saying that it is necessarily worse than the actual way, but you should elaborate it more

I highly doubt this would be true

vicktech · October 14, 2010, 12:16am

May be…
You know better.

I told it before! Sometimes reaction can be like this:

(I’m just kidding )

And it is followed by another idea:
For natural sounding of sample it should be represent as compresssed hologram.
The meaning is: All notes (or even smaller frequency shifts) are played and analyzed, then compressed by algorithm which rids of unnecessary redundancy. According to required frequency hologram will give corresponding frequency packets.
And specific sounds of sample, like noise from touching or scratching strings or beating at them will be transposed accordingly to hologram. So it will be reproduced correctly - no matter what note you play.

Idea is here. It can not be stopped

kazakore · October 14, 2010, 7:16am

I don’t know how many individual sensors/hairs they estimate there to be in the human ear but they are grouped into approximately 30-35 tone sensing areas, hence the introduction of 31 band EQs, supposedly meant to match up roughly with the frequency and spacing. (You may think it quite amazing we can pick up such subtle differences in pitch with so few receptor groups but then think about the fact the average male only has three different colour sensors in the eye for perfect 20:20 colour vision.)

What you are talking about forms the basis of most audio compression. Mp3, ogg etc. Most probably Flac too (before other processing), which Renoise uses. There is an uncompressed/lossless format that definitely uses it but it’s name escapes me.

As far as I know nobody has tried doing processing on the data when it’s in this form, does sound like it should be possible and would quite possibly help with stopping artefacts such as aliasing when changing pitch of played sample (and quite possibly make time stretching easier??)

Interesting idea but a bit much for the small dev team of this wonder software to concentrate their time and energy on!

It-Alien · October 14, 2010, 8:00am

not only that, but I think that it can be said that it is at the base of the whole DSP theory: most of effects act in the frequency domain to which the sample data is converted using Fast Fourier Transform. That’s why I am telling that his idea should be elaborated more: either I am missing the basic point of the “frequency packet” or he is telling to do what FFT already does

rhowaldt · October 14, 2010, 8:22am

this topic is definitely out of my league. which explains why i didn’t understand. you boys have fun, i’m out.

kazakore · October 14, 2010, 8:35am

Doh! Yeah of course. One of a fair few methods mind. IIRC MP3 uses DCT (same as they use for the video in MPEG (1, 2 and basic 4.)

vicktech · October 14, 2010, 4:48pm

I have only idea, technical details you know better than I do.

something like that.

Actually human’s eye has over 300 sensor systems.
For boundary between red and blue, blue and yellow, green and blue, (and so on) it has specialized sensors. To sense increasing of brightness of each colour - used special type of sensor, - for decreasing as well. There are also sensors for angles, vertical or horizontal lines, movement, brightness, and more.

It seems close to truth.

This is exactly what I mean.

kazakore · October 14, 2010, 5:24pm

Evidence? From all my learning on eyes and vision from studying television broadcasting and related subject there are two main categories of sensors, namely the Rods and the Cones.

Rods are the outer sensors, do your peripheral vision, are only black and white (monochrome) but are much better at sensing changes/movement. It is actually a lot easier to see when something moves if you’re not looking directly at it, as anybody who has done any hunting will tell you. They are also a lot more sensitive, which is why you loose colour vision at low light levels.

Cones are the centre sensors and give detail and colour. Classically we are taught there are three type, Red Blue and Green. This is true for the majority of human males and it is the lack of one of these that causes colour blindness, most common version confusing Red and Green as one of those two are missing. It is rare for females to have less than four types of Cones, hence why females are carriers of the colour blindness gene but you don’t not actually find very many women who are colour blind themselves.

Any text book you can find on the subject will agree with these facts (although it may ignore that people do often have more than three types of Cones operating at different wavelengths.)

kickofighto · October 14, 2010, 6:03pm

The human ear & sound processing system (brain) work as both amplitude and frequency sensors. The eardrum and bones of the inner ear amplify sound mechanically. In the cochlea certain hair cells vibrate at their own fundamental frequency triggering neurons to fire (also at the frequency the hair cell vibrates at!). Thus we resolve frequency at the physical and at the higher processing level, in-fact these nerve impulses still occur at the frequency of the sound deep inside the brain where they are processed. You can play a sound in one ear, and place electrodes on the sound processing centres and if you amplify that electrical signal you hear the same sound back!

This idea does have some potential. What length would a frequency packet be in time for it to be imperceptible? If we can already process 32 bit floating point numbers at 96khz. Maybe a spectrum of frequencies at 1khz would be possible but very computationally expensive. Remember that if this is on a computer it still has to be converted to audio and doing that would either require going through the sampling process as we know it or building new hardware to do it.

kickofighto · October 14, 2010, 6:12pm

True but remember of course that our experience of “perfect” colour vision and pitch perception are more expression of our own neurological makeup and physiology than they are real world phenmomena. Colours don’t exist in the real world and if they did there would be far more of them than we can actually see. Our peception of amplitute is pseudo-logarithmic. And our perception of pitch is actually quite poor. A good example of our pitch perception inadequacy is heard in FM synthesis where a reasonably low modulation frequency becomes imperceptible as a vibrato (which it actually is) and sounds like a pitch or timbre / overtone series.

vicktech · October 14, 2010, 6:18pm

It seems we’ve been reading different books, man…

That is what I’m talking about.

Signal goes to the brain being already FFT-like encoded.
So actually what you see does not exist the way you see it.
What you see is hologram memory of your brain brought up every time you perceive visual information…
World is strange, you know…

Just as it has been told.

kazakore · October 14, 2010, 6:46pm

Velocity sensors (think ribbon mic rather than diaphragm) tuned to a particular resonant frequency.

Of course they do. As much as a sound wave and frequency do anyway. And there is much more than we can see, from heat to radio, tv, microwaves x-ray and radiation. All a different frequency of the same thing. The world would be a bit messy if our eyes could pick it all up though, so are targetted at the “visible spectrum.” There are plenty of animals out there that do have infra-red (heat) vision though.

4, 5 maybe 6. Not the 3000+ you were talking about!

vicktech · October 14, 2010, 7:29pm

It was about eye vision. - It is not that obvious and simple as written in school books. Recent researches gave to us many new information. I’m just trying to rely on most recent data.

Rex_Sathum · October 14, 2010, 10:19pm

I think you must have the human eye confused with a compound eye, in which case the answer to this question:

is most definitely yes!

ntx13 · October 14, 2010, 10:41pm

Maybe this link will interest you :

It’s a sofsynth based on the approach you suggested.

I’ve tried it and found it kinda tricky to get good results, but may have missed something…

But, basically speaking, modern pitch shifting algorithms does extract the frequencies of a sound and shift them to change the perceived tone, and it should lead to almost the same results.