Hacker News new | past | comments | ask | show | jobs | submit | rpug's comments login

@saaaam, As a tech leader working in the publishing industry, I'd be interested in connecting with you to hear more about what you work on. My email is my username @lp0.org. Maybe there are opportunities for us to collaborate!


As someone who has been down this road many times before - I can't stress this enough: DDoS mitigation solutions don't solve the problem of an app-specific layer7 attack and it is important to do some testing of how well your mitigation service responds (and that it isn't a silver bullet.) Additionally, you need to make sure your team has tested and proven procedures for engaging the service, respond to attacks, etc. Services like NimbusDDoS (www.nimbusddos.com) are good because you can do some real scenario testing and make sure your team and infrastructure is prepared. There are other services out there too that I am less familiar with, but either way really good stuff to do.


The guys at CryoKey (https://www.cryokey.com/) are basically trying to do this. I haven't personally used it, though.


Does cardholder data ever pass through your infrastructure in any form?


In magento, yes... but the full information is held for a small amount time. The module is self hold no card info its a run once and then destroy.


Not that I am an auditor, but if the data ever hits your environment then you have a level of compliance to maintain.


Do you use any particular tools to track approvals to changes, policies etc?


We just use RT [1] and TWiki [2]. Changes come in as ticket in RT [1] and we discuses them at change management meetings (document these in the TWiki [2]). We document everthing in the TWiki. If someone comes and asks for our change management policy, we point them at the TWiki, which talks about puppet, and then we can show them the change management minutes, etc. We have a light process and it seems to work.

[1] http://bestpractical.com/rt/

[2] http://twiki.org/


A case management system and a wiki has typically been how I have done this. It can be a little tough though because these tools aren't necessarily built for this type of workflow. Perhaps RT does a better job than some of the other options which really want to be a support ticketing system or a bug tracking system rather than a change management system.


Yeah, we only have 4 people in ops and about 20 in development. We have daily standup where the ops guys and 1 person from dev meet (total 5 people). We discuss what is happening Past and Next 24 -- this takes about 10 minutes. The process is super light.

We also use puppet with git. This allows us to version everything that goes into production via a puppet tweak. This is great for rolling back changes or getting an of what was deployed. Like I said, read that visible ops handbook.


What about audits of user accounts and access control?


The developers want something that works.. and is fast. No waiting for builds!


Are the machines the fastest you can afford?


We are actually planning to buy some NetApp equipment and have heard nothing but good things.

I am very curious as to why a two disk failure caused an outage. What exactly happened when both disks failed?


We have used NetApp for years and decided to move to Pillar Data Systems as they are much more forward thinking, easier to work with and understand storage systems at a very deep level. NetApp wants you to buy new equipment every few years and force this by increasing support costs very quickly.

The 2 disk failure did not cause the outage, but the process the filer head had to go through to get the data back onto new drives and then further actions taken with SnapMirror and other items to try and recover faster.


The way you've worded this, and the official response, it reads like you didn't have hot spares available?

That being the case, the two disk failures didn't need to be concurrent for you to end up where you did...


RAID-DP and the array always has a hot spare, and we had one cold spare on-site and another there within hours to replace the 2nd failure.


dh, Could you send me an email? my username @ tripadvisor.com

I'd love to chat a bit about your experiences.. I'll even take you out to lunch (I'm right down the street in Newton). :P


You should have an email from me. Happy to chat anytime.


splitting the window into multiple terminals makes me happy. this is the same functionality I love in Terminator under Linux.


TripAdvisor is hiring Linux sysadmins for both corporate IT and livesite operations based in Newton, MA.

http://tinyurl.com/29bno2t http://tinyurl.com/2dhw434

Not really qualified as a 'startup', I guess.. but we keep the startup energy going :)

I can be contacted via email with any questions: ryan at tripadvisor dot com.


Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: