rpug's comments

rpug · on Oct 13, 2015

@saaaam, As a tech leader working in the publishing industry, I'd be interested in connecting with you to hear more about what you work on. My email is my username @lp0.org. Maybe there are opportunities for us to collaborate!

rpug · on Dec 3, 2014

As someone who has been down this road many times before - I can't stress this enough: DDoS mitigation solutions don't solve the problem of an app-specific layer7 attack and it is important to do some testing of how well your mitigation service responds (and that it isn't a silver bullet.) Additionally, you need to make sure your team has tested and proven procedures for engaging the service, respond to attacks, etc. Services like NimbusDDoS (www.nimbusddos.com) are good because you can do some real scenario testing and make sure your team and infrastructure is prepared. There are other services out there too that I am less familiar with, but either way really good stuff to do.

rpug · on July 11, 2013

The guys at CryoKey (https://www.cryokey.com/) are basically trying to do this. I haven't personally used it, though.

rpug · on April 29, 2013

Does cardholder data ever pass through your infrastructure in any form?

Eyes2design · on April 29, 2013

In magento, yes... but the full information is held for a small amount time. The module is self hold no card info its a run once and then destroy.

rpug · on April 29, 2013

Not that I am an auditor, but if the data ever hits your environment then you have a level of compliance to maintain.

rpug · on April 29, 2013

Do you use any particular tools to track approvals to changes, policies etc?

WestCoastJustin · on April 29, 2013

We just use RT [1] and TWiki [2]. Changes come in as ticket in RT [1] and we discuses them at change management meetings (document these in the TWiki [2]). We document everthing in the TWiki. If someone comes and asks for our change management policy, we point them at the TWiki, which talks about puppet, and then we can show them the change management minutes, etc. We have a light process and it seems to work.

[1] http://bestpractical.com/rt/

[2] http://twiki.org/

rpug · on April 29, 2013

A case management system and a wiki has typically been how I have done this. It can be a little tough though because these tools aren't necessarily built for this type of workflow. Perhaps RT does a better job than some of the other options which really want to be a support ticketing system or a bug tracking system rather than a change management system.

WestCoastJustin · on April 29, 2013

Yeah, we only have 4 people in ops and about 20 in development. We have daily standup where the ops guys and 1 person from dev meet (total 5 people). We discuss what is happening Past and Next 24 -- this takes about 10 minutes. The process is super light.

We also use puppet with git. This allows us to version everything that goes into production via a puppet tweak. This is great for rolling back changes or getting an of what was deployed. Like I said, read that visible ops handbook.

rpug · on April 29, 2013

What about audits of user accounts and access control?

rpug · on Aug 11, 2011

The developers want something that works.. and is fast. No waiting for builds!

james_ladd · on Aug 12, 2011

Are the machines the fastest you can afford?

rpug · on June 10, 2011

We are actually planning to buy some NetApp equipment and have heard nothing but good things.

I am very curious as to why a two disk failure caused an outage. What exactly happened when both disks failed?

dh · on June 10, 2011

We have used NetApp for years and decided to move to Pillar Data Systems as they are much more forward thinking, easier to work with and understand storage systems at a very deep level. NetApp wants you to buy new equipment every few years and force this by increasing support costs very quickly.

The 2 disk failure did not cause the outage, but the process the filer head had to go through to get the data back onto new drives and then further actions taken with SnapMirror and other items to try and recover faster.

spoold · on June 10, 2011

The way you've worded this, and the official response, it reads like you didn't have hot spares available?

That being the case, the two disk failures didn't need to be concurrent for you to end up where you did...

dh · on June 10, 2011

RAID-DP and the array always has a hot spare, and we had one cold spare on-site and another there within hours to replace the 2nd failure.

rpug · on June 10, 2011

dh, Could you send me an email? my username @ tripadvisor.com

I'd love to chat a bit about your experiences.. I'll even take you out to lunch (I'm right down the street in Newton). :P

dh · on June 10, 2011

You should have an email from me. Happy to chat anytime.

rpug · on Jan 24, 2011

splitting the window into multiple terminals makes me happy. this is the same functionality I love in Terminator under Linux.

rpug · on Dec 9, 2010

TripAdvisor is hiring Linux sysadmins for both corporate IT and livesite operations based in Newton, MA.

http://tinyurl.com/29bno2t http://tinyurl.com/2dhw434

Not really qualified as a 'startup', I guess.. but we keep the startup energy going :)

I can be contacted via email with any questions: ryan at tripadvisor dot com.