I assume it is looking for 5 fingers. I noticed if i folded my thumb in, it did not recognize the gesture so maybe an edge case where people missing fingers can't use this.
Also when I scratched my face briefly it did recognize it... funny.
I would strongly recommend trying to build a web demo somewhere down the line. Even if it requires flash to get the webcam working, I'm sure your conversions would go through the roof if you could show how well it worked instantly on your landing page.
Small note -- the blurring of the image is distracting rather than focusing. Down the line maybe think about a better landing page image.
We hated using the keyboard and mouse for controlling music in the background or watching movies on our computers from a few feet away. So we decided to solve this problem by using hand gestures to control music and videos via the built-in webcam in your computer.
Please download and try it. Do let us know what you think in your comments below or by emailing us at feedback [at] flutter.io.
after typing in my email and name i got this:
The link that you have given is not valid. Please verify that you have copied the link completely. This incident will be reported.
I hardly ever get excited after downloading something, maybe a, wow that's cool, but when I installed this (which was very easy) and it just worked, I actually clapped and said 'this is fucking awesome.'
I had the exact same feeling. You see these gesture controls in movies and demos and they seem great, but it's truly exciting when you get to use it yourself.
Thought that this was great - how about a next/previous track gesture, and being able to change the frequency that it checks for the hand? (it looks like it's time based)
I believe one of the co-founders is Navneet Dalal, who, along with Bill Triggs, developed the "Histograms of Oriented Gradients" (HOG) features that are widely used across all of computer vision†. Thus, I'm guessing that there's some serious vision in use for this app, which is pretty awesome!
(My apologies to the other co-founders, who are likely just as talented!)
However, music control isn't a pain point for me. What I would pay for is a start/pause gesture for Netflix/Hulu/Amazon Video. Worst thing about TV on a computer.
My friend and I looked into controlling those apps for a program we made to control media players with your voice called Shoevox - http://shoevox.org
The hard part was that we used sending keyboard shortcuts as our method of sending commands (which works for programs we support like WMP or VLC), but we couldn't do that reliably to Flash. We thought about switching to UI Automation but haven't done so yet.
I just ask my very non-technical girlfriend to come see this new app I downloaded. I showed her flutter.io working in action. She shrieked and said, "that's awesome, how do I get it?". That's a very good sign for you guys...nice job.
Though I know there's a chance you'd not be willing to share the information, I'd love to learn a little bit about how you built this product.
It appears as though you're using ML to recognize the gestures—anything else you could share would be fascinating. I'll be sending in a job application, as well!
You're right. We use image processing, computer vision & machine learning to detect Flutters. Our entire tech stack is custom written as we are solving some quite challenging & difficult AI problems.
Interesting/cool project, though I do wonder how you plan on keeping it simple when you expand and add additional actions?
The UX of controlling media using the Xbox Kinect leaves a lot to be desired - it's cumbersome.
I've had my mac mini connected to my tv for awhile and am very interested in the Internet TV space in re: what will become the standard way we control such technology? Will it be thru gestures, airplay on iphone/ipad type technology, voice driven or a mix of all?
For many years I've been using a wireless gyration mouse/keyboard (demo of Internet TV setup http://www.youtube.com/watch?v=l-IryK4wsUM) and recently I started to use Chrome's voice search to quickly bring things up especially if my wireless keyboard isn't reachable.
After seeing this and my years of enjoying the Internet on my TV I'm thinking a few short gestures like this app uses and mix of voice driven actions might provide the best UX.
Based on the views and comments of video I link to above people want the Internet on their TV and all it provides. We just haven't figured out the best UX for it.
Paul - that is a great observation and feedback. We believe that future will be about us using touch, voice, and gesture all three depending on context and circumstances. We do plan to keep it simple, and we have some great ideas for it.
"The music swirled and dived for a moment. Another voice broke in, presumably Halfrunt. He said "Vell, Zapbod's just zis guy, you know?" but got no further because an electric pencil flew across the cabin and through the radio's on/off-sensitive airspace. Zaphod turned and glared at Trillian -- she had thrown the pencil. "
This is very cool. It worked perfectly even in the dimly lit room I am sitting in.
What I've been wanting and have thought about developing myself is a universal play/pause control that pauses whatever you happen to be listening to at the moment, whether it be a desktop client like iTunes, Spotify, Rdio, etc, or if it is something running in a web browser like Pandora, Soundcloud, Youtube, etc.
This app takes a step towards this in the way it handles iTunes and Spotify at the same time, by controlling whichever app had focus last.
I imagine for web browser control a browser extension would necessary. And ideally a plugin architecture would exist to allow 3rd parties to add support for additional apps.
A different but functionally similar idea would be an automatic pausing of all other services when you start playing one (i.e. when I start playing a Youtube video, automatically pause iTunes).
There needs to be a way to override those behaviours though, as there are situations where the audio of a YouTube video (for example) isn't very interesting, and you'd rather listen to music. Lack of background-playing in the iOS YouTube-app is still an annoyance to me.
Really cool. I was just playing with it and my girlfriend walked over to ask me a question. I held up my hand to my computer as if to say "Hold on, computer."
It was a bit surreal. Very cool. Can't wait for more gestures or a generalized way to program my own gestures. I would love to be able to gesture "Switch to iTerm.app" by doing an OK sign or something. Sign language letters maybe?
Anyway, thanks. Really enjoy it already, can't wait for more. Also, I would pay money for this. If it had all of the play/pause/forward/back functionality I think I'd give you 4.99 if it was accurate. Just sayin'.
My thoughts almost exactly on the programmable front... like Better Touch Tool but rather No touch Tool, although the latter name does retain the locker-room hilarity of the former.
Perhaps a gesture to bring up the app switching ribbon and a gesture to reference the item (numbered from left to right maybe.)
I'd also like to have the option to calibrate it, my eyes are painfully photosensitive and I tend to work in very dimly-lit conditions.
What's nice about this is you don't need to switch screens to pause the music if you find yourself writing/coding and need some quiet when you're in the zone. Granted you can do this with the mac play/pause physical buttons but that would require your eyes to look down and pause your thought process.
This happened to me just now after I installed Flutter, and it seemed to me that physical gestures didn't interrupt my higher brain functions. Of course when I noticed this it made me interrupt my work and come to HN to leave this comment. The irony.
Glad you noticed it. We want you to never think about controlling your music at all. This is one of the things I love too... we listen to music in a background, so controlling it should be in the background too.
That's what I like most about it too. Love not having to switch screens to stop the music when a call comes in. One thing: I didn't find the installation super intuitive. Probably some room to improve that experience
By the way I see great application in this for education, particularly with smaller kids
Flutter seems to keep the camera on 100% of the time which prevents apps like Skype from using the camera for video chats. I'm not sure if there is a good workaround for that?
Andrew - great observation. On top of your menubar, there's Flutter icon. Click on it, and it will give you simple flutter on and flutter off capability.
There is another, more fully-featured app called FluidTunes which I used quite a while ago. Not sure if it still works on the latest version of iTunes: http://majicjungle.com/fluidtunes.html
This is awesome, but one question/suggestion: when a user is going through the "How to Use Flutter" tutorial, do the gestures work then, or only once the tutorial is closed? I kept trying to do it, but there wasn't a response until I closed the tutorial.
Only when tutorial is done and you're using iTunes or Spotify. First time you bring up either of the apps, it will notify you through top menu bar that Flutter is controlling it now.
Once that happens, you can control in the background or even if it is minimized.
Very cool although with CPU usage at about 30% on my Air I won't be able to use it much. If you can figure out a way to drop average CPU usage down to reduce heat & power usage that would be a big deal.
Hi Hedgehog - thats a great point. We thought of it, and when you're not doing a gesture, CPU usage should drop to around 2-3%. Let us know if it doesn't. In our test, average CPU usage was around 5%.
Interesting. It does seem to drop to about 5% if I'm not in front using the computer and not in front of it but otherwise floats between about 7% and 20%. This is on a MacBookAir4,2, based on a few minutes of testing it seems like it's about 65% additional power consumption when the machine is otherwise mostly idle. I'm actually impressed though that you got power consumption as low as it is. Photo Booth consumes about 6.5% so I'd guess that's the lower limit for Flutter, maybe default to disabling Flutter while the computer is on battery?
Few questions: Aren't you entering a field which has patents covering gesture controls on media exactly like this?
A comparative advantage of mice and keyboards is that a minor amount of energy, (but large amounts of co-ordination) are leveraged into larger changes.
Wont this limit the number of the gesture controls you can incorporate?
(OTOH - if you had enough fidelity, you would just need a dummy keyboard. You would just have to map finger movements with key presses. you could just create a webcam of your own as well.)
Soooooo. Doesn't this open up a whole new realm of privacy concerns? Who cares if Path has my contacts list, I'm more concerned that you guys are watching me surf for porn!
Legit concern. This is why we've built an app w/o a server component, so everything happens on your machine. We do not capture anything from your webcam. It all happens on your machine. On top of it, there's a turn off capability which will allow you to turn it off for time being.
Hi guys. Since there seems to be an ML component, I was wondering how this was built. Do you start with a basic hypothesis and see if it holds for some training set and then refine it or right off the bat go with a generic classifier like SVM (with some features in mind) or something like that and then tweak it?
Also, how did you choose the dataset for this ?
This project is very cool. Congratulations and good luck!
I was blown away when I saw a demo of Flutter a couple of months ago. Their Spotify and iTunes integration only scratches the surface of all the gestures that they supported in that demo. I can't wait for them to add some additional gestures, such as skipping to the next track, and PC support so that I can get rid of my keyboard with the dedicated music player controls.
This must be amazing app to download as it is clear from the feedback received. Please don't forget that we are a large enough community who uses windows. Especially in India. We are already excited even before you have our version. the kind of capabilities it gives, would be amazing to download. Congrats in advance to the team.
Fantastic app, does a fantastic job of recognizing my palm - unfortunately, it seems to also really like my face. After the tenth or so time that it had paused my music without me raising a hand, I realized that every time I turn my right cheek to the camera, Flutter pauses my music. Which is a little troublesome :)
This is fantastic - absolutely the most satisfying method of interaction with an app that I've experienced in a long time. Expand the gesture set to include skip forward/back and it's a winning consumer app. Personally, I'd love (and would pay for) a library allowing me to add gesture control to my own apps.
I did a similar hand gesture recognition software to control image viewer as part of my computer vision course project. It was really fun to do. I believe, presentations and lectures are the places where this kind of technology really makes a lot of difference. Good luck with your startup!
How far are you guys from full Minority Report-ization of my Mac?
I mentioned in another post that I tend to work in very dimly lit spaces due to painfully photosensitive eyes. A non-mandatory calibration dialogue would probably be a handy thing for folks like me.
I saw Samsung demoing some similar technology during SXSW (using a kinect that was hooked up to a mac mini and being displayed on one of their LED tv's), and it didn't work anywhere near as well. Congrats.
Works great once I realized that I couldn't be so close to the camera, but does anybody have any tips on disabling the green iSight LED on my macbook pro (short of covering it)?
Supposedly it's hardwired, you can't power the camera without powering the LED. I wrote a security app that grabs from the iSight every few seconds, and got tired of the light flashing on and off all the time, so I just covered it with a little black sticker.
Quick note. Your site assumes I am using Chrome - FF doesn't save downloads in the bottom left hand corner. Easy for me to figure out, impossible for someone like my mom.
Hey guys this is some really cool stuff. If you can integrate this with browser gestures to scroll, refresh, and move back and forward on pages. This would be awesome.
I so hope this is an art project and in a few days we'll have a website of people making funny gestures to the camera to pause and play Justin Bieber songs ;)
In case anyone was wondering, I can confirm "mirror stops" work: try facing your laptop towards a mirror and doing your gestures from behind the computer- wow!
This works incredibly well. I wasn't sure how accurate this could be without the kinect-style depth sensor but it's surprisingly accurate. Great work guys!
Hi, we are not limiting it to that. We want to add more to the list. It just takes time... we're still a small 3 people team. Know any great engineers/designers who can help us execute faster? :)
issue is that even when i am not controlling the playlist and randomly moving my hand it recognizes it as gesture... seems like lot needs to be done to differentiate between noise and actual gesture. anyways nice try.
Hi. Can you please try again? It just worked for me? If you still get an error then please send email to Feedback [at] flutter.io an we will send you diff link. Thanks so much. M
Surveillance in the name of convenience has gone too far. I can see a future where companies offer services as wrappers around visually analyzing people's behavior in their own homes. They will sell that data just as they do now. That's not a future I encourage.
I assume it is looking for 5 fingers. I noticed if i folded my thumb in, it did not recognize the gesture so maybe an edge case where people missing fingers can't use this.
Also when I scratched my face briefly it did recognize it... funny.
I would strongly recommend trying to build a web demo somewhere down the line. Even if it requires flash to get the webcam working, I'm sure your conversions would go through the roof if you could show how well it worked instantly on your landing page.
Small note -- the blurring of the image is distracting rather than focusing. Down the line maybe think about a better landing page image.