Hacker News new | past | comments | ask | show | jobs | submit login
Launch HN: PingPong (YC W21) – Video messaging for remote teams
90 points by jeffwhitlock on March 7, 2021 | hide | past | favorite | 68 comments
Hi HN,

I'm Jeff, co-founder and CEO of PingPong (https://getpingpong.com), where we help remote teams collaborate and stay connected by exchanging video, voice, and screen recordings asynchronously. Think of it as Marco Polo or Snapchat (sans ephemerality) for globally-distributed product teams.

We got the idea from our own experience on a different startup. Murphy, the co-founder and CTO, lives in Nigeria, and I live in the US (Utah). We had five other team members spread across three countries and four time zones. The problem for me as a team leader was how to share my ideas, feelings, and updates in a way that felt authentic and conveyed energy. I felt like I was spending my entire life either typing in Slack or scheduling Zoom calls at terrible hours. I had been kicking around the idea for our product in my head for months. At some point, I began thinking more about how to improve communication for distributed teams than what I was working on. In February 2020, after my second child was born, Murphy and I planned to slowly build a solution to these problems on the side. Then, the COVID situation exploded, and we knew it was time to go all-in. We pivoted our focus entirely. When we first started, our name was Girbil. The first version of the site is laughable: https://www.girbil.com/.

In synchronous work cultures, people expect contextual understanding in their interactions because it's assumed that everyone absorbs that context by osmosis in meetings, informal chats, and channels. Workers are used to having large portions of their day plugged into a flow of just-in-time information like a network. With asynchronous cultures, it's better to assume little or no prior contextual knowledge in every communication. Context is given by referring to documentation or by laying it out specifically. Workers are used to having large swaths of uninterrupted time to work deeply. Because we lived across continents, most of our work and communication had to be asynchronous.

We felt that current chat tools (e.g., Slack, Teams) are built for synchronous teams. You can see this in many of these products’ design decisions. For example, pressing enter sends a message instead of line-breaking, encouraging short-form messages that provide minimal context. Furthermore, the lack of a message workflow encourages a constant stream of low-value messages. By default, instant notifications are sent for every short message—even when the recipient is in do-not-disturb, the sender has the power to override. All these design decisions promote a constant flow of shallow, quickly-scanned information demanding immediate responses.

We want to build a communication product that better meets the needs of distributed product teams like ours—designed first for asynchronous teams across multiple time zones where it can be hard to get face time. We decided to start with video. We felt the medium itself, though not perfect, addresses a lot of the issues we experienced. With video, you tend to record only when you have something important to say. Most creators want to "sound good," so they put thought into what they're saying. Additionally, the listener can't "skim" the messages, and they receive a richer message in terms of intent, tone, and energy. Sharing an asset while screen recording adds another layer of depth and efficiency. But we're still trying to figure out how to maximize the above strengths of async video while mitigating its downsides (e.g., its linear nature). For example, we've limited face and voice recordings to two minutes to encourage succinctness (though screen-share recordings can be 10 minutes). We'd love to hear your ideas here. We've also made some design decisions to help teams focus. For example, instead of channels, we've built conversations. These are designed to be started with a specific goal and are very easy to leave, close, and end. Users can have multiple workspaces, but they’re hidden instead of ever-present.

We hope to ultimately build a Slack/Teams alternative designed for rich, asynchronous human interactions that encourages deep work. We still have a lot to build before we achieve this goal, and we'll need to incorporate text and file attachments at some point. Today, we use PingPong for 50-75% of our team communication. When we have to send a file or structured text, we use Slack. We love using our own product to collaborate as we work on it together.

We'd love to hear your thoughts, feedback, and ideas—particularly if you'd like to share any limitations or frustrations you experience with Slack or Teams as we did. Thanks for reading!




> We hope to ultimately build a Slack/Teams alternative designed for rich, asynchronous human interactions that encourage deep work.

All I can say is keep going. I've been feeling the absence of this for a while, and have taken to making occasional videos on Loom to supplement live meetings and Slack messages. But it's a disjointed experience, and Loom is not trying to be a collaboration tool at all. It would be great to have a solution that does the whole thing end-to-end, with a focus on being more async-friendly than Slack.

By the way, I downloaded the PingPong app, and it seems great. Are you selling subscriptions, or just putting the free version out there for now?

P.S. If you haven't read this article on Figma's growth strategy, I'd take a look. There are some parallels with what you're doing -- i.e. SaaS, collaboration-focused, and attempting to dislodge incumbents with a similar list of features, but less focus on collaboration pain points. https://kwokchain.com/2020/06/19/why-figma-wins/


> Loom is not trying to be a collaboration tool at all

ngould I'd love to understand this point of view. What are the pain points of the experience being disjointed between Loom and Slack for your team right now?


Thanks for the encouragement and article on Figma.

> By the way, I downloaded the PingPong app, and it seems great. Are you selling subscriptions, or just putting the free version out there for now?

No paid subscription for now; we're optimizing for usage and feedback. We should always have a free plan and introduce new features into a paid tier in the long run.


Any expectation on pricing? Would be nice to know what your prospective model looks like before diving in.


We don't have definitive plans at this point, but it will likely be a few dollars per user per month. But what's currently in the product will probably be free indefinitely.

We hope to encourage people to pay with new, great features. I'm against moving features we've already given our users for free behind a paid plan.


Sounds great!


Hey, congrats on your launch. The first thing that comes to my mind is I hate recording myself. Even on zoom meetings I try to turn the video off as much as possible. It also seems easier to me to share a thought or suggestion over chat when there is no expectation of instant reply given writing something down forces you to filter your thoughts and specify the concrete details. How do you show the utility of self-recording to such people?


Out of curiosity, is this true for voice and video or just video? We added voice recordings as a feature for people who don't want to record themselves on camera. We're also adding text as a "type" of message you can send.

One of the interesting things we've noticed is that recording yourself seems to encourage more intentionality in what is being said than a quick chat or SMS (maybe because you're more self-aware while recording?).

As you mentioned, some people are very intentional while writing, but I'm not sure that's the norm.

Many of the chat tools' design decisions encourage stream-of-conscious writing (e.g., press enter to send instead of line-breaking, one-line text boxes).


I think this intentionality comes from the person more than the tool; like, as obvious as it is to you that chat causes people to dump a stream of consciousness while audio forces people to be "intentional" and think about what they are saying, it is frankly just as obvious to me that when people are speaking they tend to "ramble" as they attempt to somehow construct a thought on the fly (and often end up repeating things they have already said) whereas with text people construct and edit well-organized and considered statements.


Very fair point. Being recorded is an interesting added dynamic vs. talking live, but you still could very well be right. It is for sure idiosyncratic.


Sometimes you don't have to share your face... just the notepad and pencil, and you can talk as you write.


Hello everyone co-founder and CTO at Loom. There are lots of conversations around owning the end-to-end communication loop with Loom (and other async video products). Our team went over a ton of use cases we use Loom for that are pretty relevant to this thread in a webinar I'm linking to below.

https://www.loom.com/share/fe062f809754416b9d583266b47b9303

We will continue building out our platform, but we are instead focusing on features that generate a ton of value and are very hard to do (transcription for every paid user, instant trimming/editing, etc.). If there are other suggestions/gripes/comments about Loom, I'd love to hear about them! If you don't have a Hacker News account but do have a Twitter account, my DMs are open(https://twitter.com/vhmth).

Other than that, I want to say hats off to the PingPong team and other teams getting into this space!


Thanks for the encouragement!

That focus for Loom sounds on-point. As you said, those are hard to do and extremely valuable, which further differentiates.

Truthfully, I’m honored that you took the time to comment. I am a big fan of Loom and have been for some time. We pay for and use (and plan to forever use) Loom for some use cases at PingPong (e.g., videos on our product wiki, support pages, reporting bugs in our own product).


Thank you so much for recording with us Jeff!



Helpful (http://helpful.com) was working on this as well but got acquired (or maybe acquihired?) by Shopify. I'm not sure how much traction they got or if the product was just before its time. I suspect there are a bunch of lessons to learn from their efforts.


It’s definitely a timely idea with the remote work zeitgeist. I saw those launches and wish them the best.

That said, we’re much more focused on video and screen recording, whereas these are focused on voice.


Congrats on the launch! I work for a video-chat startup as well, and although we’re focused on synchronous rather than asynchronous, maybe my perspective will be useful.

In the “Woice” launch-thread, I noted that the power of voice-communication is in its real-time feedback loop that is like a collaborative search algorithm for both identifying and solving problems. My concern is how to keep that algorithm operational while losing the efficiency of real-time feedback.

I think the answer might come from my latest Slack irritant — that threads are only one-level deep. A deep conversation is just as difficult to hold over Slack as it is in real life, as it forces conversations to be linear and single-threaded. Due to that, it is common to “lose the thread” of discussion.

With async communication, it is possible to have a multi-threaded non-linear conversation. While losing the efficiency of being real-time, nested asynchronous communication could enable a broader search space.

A few commenters have mentioned Loom. But imagine if you could reply to a timestamp in Loom with your own video rather than an emoji. In a normal face-to-face conversation, you would interrupt someone to make a comment or question, and this would often change the course of the conversation.

Although face-to-face conversation is on the surface single-threaded and linear, I think good conversations actually hide the non-linear threads in people’s minds as the discussion alternatives between meandering and refocusing, enabling a wide-ranging conversation where many topics are examined and ultimately synthesized. The original thread is never lost even as other threads of relevance come and go.

So that’s my two-cents and I hope it’s helpful: to keep in mind that synchronous communication is only apparently single-threaded and linear, and that when moving to asynchronous you might want to explore how to retain (if not amplify) multi-threaded non-linear communication.


Your perspective is helpful; thanks for sharing. That’s a good point about async opening up the possibility of multi-layer threads. We’ll consider that! We’d need to figure out how to do it in a way to help people not get lost. I’ll also think more about if there’s a way to somehow recapture the efficiency from real-time feedback in synchronous voice communication.


I had actually bookmarked a service years ago that sounds (from the comments) similar: Talkshow.

https://news.ycombinator.com/item?id=19561585

(I accidentally clicked the bookmark today and was confused if it was this thread ;P.)

This company might have had some relationship to the Clubhouse team?

https://twitter.com/semil/status/1254756645059489792?s=21


I had a hard time finding Talkshow - looks like they might have shut down (I could be wrong). But there are some relevant comments in that thread; thanks for pointing me here!


Sup Jeff! Glad to see you on HN! Been using pingpong almost every week since installing it! I am sharing my feedback here but I already told you about it all:

- lenght of video, needs more options - a way to see reactions to my vids faster and more concisely, maybe a timeline view - better UI/UX for threaded back and forth pingpongs - that camera bug on desktop where my face is super zoomed in

Boom! Love it!


Thanks for giving it a try, and for the feedback. Keep it coming!


I do think you're on the right path. There's a list of things as a manager I appreciate about pingpong's potential:

1. More personable 2. Ability to check on remote employees indirectly 3. Seems like it can lead to more meaningful company communication 4. I don't feel I need to organize a video chat to convey things that may be harder to convey via text. It's nice to have a quick option to record and send.

These are the items that are preventing us from implementation into our daily. We have a small team of 8 people (3 designers, 1 customer service, 3 operations, 1 marketing)

1. We'd rather have a one stop shop for our communication (ie. ability to send files & code corresponding with the video) 2. Currently just resorting to a facetime call or slack video call

Regardless - I'm still going to push my team to use this and get some more feedback as it seems your team is using it more frequently than slack.


Thanks for the great feedback. This is a focus for us to figure out.


I really like this concept. I think for me one of the things that kills me about Slack is the way people write tiny little stream of consciousness messages. These are way less thought through even than they would be if you were having a normal conversation and serve as a distraction when it comes to the constant notifications.

I think asking people to stop and think a little bit (which I believe recording a video does) is great. I'll take one thoughtful video over 50 incoherent messages any day of the week.

Also thinking about trying this for a daily sync/standup since we often hear that people are not really listening to their teammates during these but instead are thinking of what they're going to say instead, which is of course entirely self defeating. Taking a minute to record your own video and then being able to consume your teams videos in an non-distracted way sounds spot on.


Thanks for the feedback. One of our team's best productivity hacks is using PingPong for standups and sharing our screen to show whatever it is we've been working on. It's incredible for team understanding and engagement. It's like a mini-async sprint demo every day.


Congrats on the launch! It definitely seems like it is a deficiency within the current suite of communication tools. I saw you mentioned in your post that you use Ping Pong and slack, and that ping pong will eventually have text and file sharing. Just curious, what are scenarios when you need to use structured text? Is it more for documentation purposes e.g. lists, tasks, etc.? I see the value in both, but it definitely makes sense to lead with video to prove out that value. I almost feel like 50-75% verbal communication better mimics an in-person office setting. But still gives you the opportunity to plan out what you will say before sending it.


You've come to the same conclusions I have about video that I'm using for my next startup (and YC application), though we're using it for a different purpose. We floated the same name as well, so I'm glad we didn't pick it!


That's a funny coincidence. Excited to see what you build! Let me know if I can help.


Sure thing. E-mail is in my profile (you don't have one in yours).


I founded and built a company based on a threaded, asynchronous multimedia forum system about 12 years ago. It supported any type of content that was 'V.I.T.A.L' : Video, Images, Text, Audio and Links.

(I sadly had to resign from it, after getting royally screwed over by the 'investor relations guys' who were on the 'pump and dump' of our OTA stock, after we went public via a reverse merger.)

I have been thinking of building something like it again recently, using everything I've learned since - technologically and financially.

Best of luck! :)


Dang, sounds like an unfortunate experience. Now is a good time to do it! Happy to compare notes.


The whole last year has definitely been 'the right time' for something like it again. And especially across timezones.

These days, I would probably only offer it as a white-label/SaaS model rather than an 'open' social network. Possibly something private/invite-only.

The past few years have made me think that I'm extremely glad to have not been involved at all with it any more. I'd had some extremely lucrative offers from political parties for them to use one of its features, which I'm glad I turned down. The effect it would have had on the incumbent beta-testers at that stage would have been catastrophic.

I would also stick to 'organic growth' rather than outside investors, who could absolutely destroy your vision for the product, for their own ends, without a care in the world for the users and community. Lesson learned the hard way.


Any plans for supporting Linux? Half of our team (30 ppl) is Linux-based and I think it's often the case for software engineering teams.


Yes, we plan on supporting Linux soon. We built our app using Electron, so the lift shouldn't be too heavy to get it working for Linux.

I'd love to have your team beta our Linux app when it's ready. Email me if interested. My email is in my profile.


Could be an interesting proposition for classroom use—some of my students juggle babysitting while they are in my class and their grades suffer. State require synchronous instruction time, leaving an inequitable gap for students whose parents work during the day. Will you provide premium access to educators?


Great question. We're focused on helping remote product teams, but I'd be happy to give premium access to educators—if they think it will help!


Love the idea of more asynchronous collaboration. Helps preserve valuable engineering flow time while still allowing for thorough feedback from stakeholders. Are there ways to know when someone has viewed my video? Excited to try this out with my team.


In what contexts and use cases do you see lo-fi video outweighing text or the traditional mediums most? I can see some use cases where this would definitely make since. As some of the other comments mention, I imagine getting people to initially using this new medium is key.


Good Question. With video, anything that carries emotional content or where tone/intent can be misconstrued (e.g., gratitude, announcements, feedback, following up).

With screen recording, we've learned that so many messages can be made more efficient by showing an asset while you talk (e.g., documents, presentations, designs). We even will show our IDEs while doing standups over PingPong.


I’ve been on a few teams that used to use slack and loom to achieve a similar effect. One benefit of loom though was the ability to have our internal communication and demos to customers in the same space. Any plans to offer sending a video to someone not in your workspace?


Currently, you can copy links and view the videos on the web. You can't organize the videos in folders like Loom, though. Our goal is to facilitate a far more seamless back-and-forth collaboration.


I encounter creative folks who really don't like Slack's notification system, and hate how the workplace has become a bunch of impersonal pings.

Can you talk a bit about the tech stack? How is your video handled and are you using any services worth noting?

One thing that Slack is useful for is an accumulation of a searchable knowledge base. Do you have any plans to make the videos searchable and/or are closed threads/conversations deleted to the end user?

Overall - this makes so much sense. At the beginning of the Covid lockdowns, my wife started regularly using MarcoPolo to keep in touch with her friends. She loved the semi-serendipitous messages and intimacy exchanged. This is definitely missing in the workplace.

Congrats on launching.


Thanks for the questions.

For desktop we've built an electron app with React and Node. Our mobile apps are built using Flutter.

We use WebRTC for the camera stream and web sockets to send the video stream to an EC2 instance. All data is stored in Amazon S3.

Your knowledge base point is a good one. We're working on searchable video transcripts and a tagging feature to make it easier to organize and retrieve the information.


Love this idea. I think the depersonalization of slack, the difficulty to organize of Snapchat, and the cumbersome nature of zoom needs to find a good spot. Hopefully this is it.


This is clever; it’s pulling a common behavior (audio messages on whatsapp) and pulling it into work culture. Slack followed this strategy pretty well.


Exactly. Many people prefer text messaging to phone calls because it’s faster and integrates with the rest of your life.

Slack brought SMS to the workplace.

Consumers use Snapchat, WhatsApp, and Marco Polo for video/voice recording, but no one has cracked this for collaboration in work cultures.


Looks nice we want to try it out. We cannot find the slack integration though, is it available yet?


Yes, our Slack integration will be in a beta for a few more weeks. Email me if you'd like access. My email is in my profile.


Lots of naming options and this one is one of the worst IMO. To much history of what ping pong is already and because it is an active thing it as opposed to stripe which is a static not activity thing. Worth changing ASAP


Thanks for the feedback. Naming is hard!

Is the concern that it may be too easily confused with the game? I'd love to understand your feedback a little better.

We felt that PingPong gives a pretty clear picture of sending something back and forth.


I saw ”PingPong” and ”video messaging for teams” in the title and immediately got idea what this is likely about (and it turned out to be correct).


Is it end to end encrypted? It sounds like something I'd like to use, but if it's not private from the service operator and their hosting providers I probably can't use it for most of my projects.


Good question/feedback. The app is not currently end-to-end encrypted.

We've looked into doing this but haven't found a way to do it while supporting transcription.

We're currently planning on implementing single-use keys to balance security with transcription needs.

Would love to hear your thoughts on this.


Is transcription really a critical/killer feature? Does the product fall down without it? Can it be done clientside and then the transcribed data be encrypted e2e alongside the av streams?

For me if it's not e2e it's a nonstarter, which means I'm not using the transcription feature because I'm not using the product at all.


For some transcription seems to be critical (see the search-ability knowledge comment above), but e2e is a deal-break for others. I totally understand where you're coming from too.

Perhaps we'll figure out a client-side solution or create ab e2e channel setting where transcription is disabled.

Really appreciate the push to figure this out.


Sounds like a great idea, will test it in a bit


Looks like you forgot to configure Google auth origin


Thanks again for pointing this issue out. It's fixed now. Would sincerely appreciate it if you can confirm that it works for you.


Thanks for pointing this out.

Did you happen to do this in incognito or different privacy mode?

We have a bug where Google auth does not work in these modes.


I'm not the original poster, but I also had the same issue. I'm not in an incognito window nor do I have any special privacy mode (I'm using Chrome on OSX).

Error 400: redirect_uri_mismatch The JavaScript origin in the request, https://www.getpingpong.com, does not match the ones authorized for the OAuth client. Visit https://console.developers.google.com/apis/credentials/oauth... to update the authorized JavaScript origins.


Thank you so much for reporting this. Figured out what's causing the the issue and are fixing it now.


[flagged]


"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html


I'm sorry if I sounded dismissive, I was just impressed by the way the text sounded to Me, as I read it, and figured people might be interested in the data that I collected as I thought made said text sound weird to Me. I have no idea why My comments would not be considered a good critical comment to you the same way I think it looks like to Me. Get My point?

(I never meant to be shallowly dismissive but rather provide good criticism and lament if it sounded otherwise)


I don't really get your point, but it may be relevant that when I edit launch texts for YC startups [1], I tell founders not to refer to themselves by company name. As in: don't say "FooCo lets you bar your baz"—that's a trope of marketing language. Put it some other way, such as "We let you bar your baz". Perhaps that advice is pushing things too far towards "we"? I don't know; I didn't notice it yet, but there must be some reason it triggered an adverse reaction.

[1] and non-YC startups! anyone is welcome to ask for help with this at hn@ycombinator.com, as long as you remember that the worst-case latency for response times may be astonishingly terrible. You can see the advice we give to YC startups here: https://news.ycombinator.com/yli.html. The logistical bits don't apply to non-YC cos, but the communication bits emphatically do.


Substituting "we" for the company name would probably not help at all. The text simply is too personal. This is particularly bad for a company whose goal is likely to provide value with absolute minimal friction—the potential user is primarily looking for something to improve their lives and wants to know what can be done for them. I urge you to highlight all the occurrences of "we" in the text and see how many phrases are meaningful in that regard, and for the ones that are, how most can be simplified by being rewritten as impersonal.

I see Dale Carnegie getting a lot of flak these days but this is one point he illustrates incredibly well in his most famous book in my opinion.


I think you may just have a different stylistic preference from what I understand to work well on HN. I tell the founders explicitly to include themselves personally and include the backstory of how they came to work on this.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: