Events‎ > ‎

Tachyon: an open source memory-centric distributed storage system

posted Jun 11, 2015, 11:53 AM by Hao Xu   [ updated Jul 2, 2015, 3:07 PM by 郭晓峰 ]
Links:
本次活动注册表:http://techmeetup-20150719.eventbrite.com
本次活动详情链接:http://www.tech-meetup.com/events/20150719
加入我们的communityhttp://www.tech-meetup.com/wechat and http://www.tech-meetup.com/signup

Time: 1:30PM ~ 3:40PM, 07/19/2015, Sunday


Location: 1320 Ridder Park Dr, San Jose, CA 95131


Agenda:

1:30pm - 1:50pm: Reception and social time

1:50pm - 2:30pm: Session 1 by Bin Fan

2:30pm - 3:10pm: Session 2 by Shaoshan Liu

3:10pm - 3:30pm: Q&A and offline networking with Bin Fan, Shaoshan Liu and Haoyuan Li


Tech Talks Abstract


  • Session 1: Tachyon overview

    Abstract: Tachyon is a memory-centric fault-tolerant distributed storage system, which enables reliable file sharing at memory-speed. It was born in UC Berkeley AMPLab. It is open source and is deployed at multiple companies. In addition, Tachyon has more than 100 contributors from over 30 institutions, including Baidu, IBM, Intel, and Yahoo etc. Earlier this year, the latest spinout from AMPLab, Tachyon Nexus, started to commercialize Tachyon. The company is funded by Andreessen Horowitz. It was also recently listed on 9 Hot Enterprise Storage Companies to Watch by Network World and Computer World. In this talk, we present an overview of Tachyon, as well as some recent development and use cases.


  • Session 2: Fast big data analytics with Spark on Tachyon in Baidu

    Abstract: In this talk we will focus on how Tachyon can help improve big data analytics (ad-hoc query) efficiency (up to 30x performance improvement) within Baidu. In detail, we will explain: Currently within Baidu, we have a production Tachyon cluster with 150 nodes and over 2 PB of storage space, this cluster mainly serves as the cache layer for our  Big Data Analytics engine. In this talk, first we introduce the Big Data Analytic infrastructure within Baidu.  Then, we explain why we started using Tachyon several months ago, as well as the problems encountered when we started using Tachyon. Next, we delve into the details of how Tachyon help accelerate our Big Data Analytics pipeline at its current state. At the end, we discuss what new features we want to see and the plan to scale further.


Speakers’ bio:


  • Bin Fan is a software engineer at Tachyon Nexus. He is a top committer of the Tachyon project. Prior to Tachyon Nexus, he worked in Google to build the core storage infrastructure and won Google's Technical Infrastructure award. Bin got his Ph.D. in computer science from Carnegie Mellon University.

  • Shaoshan Liu is currently a Senior Architect at Baidu U.S.A. working on Big Data Infrastructure. Before Baidu, he worked at Linkedin and Microsoft. Shaoshan has a Ph.D. from UC Irvine.

  • Haoyuan Li is founder and CEO of Tachyon Nexus. He is a Computer Science Ph.D. candidate in AMPLab at UC Berkeley, where he co-created Tachyon, an open source memory-centric distributed storage system. He is also a founding committer of Apache Spark. Before Berkeley, he worked at Conviva and Google. Haoyuan has a M.S. from Cornell University and a B.S. from Peking University


主办
湾区同学技术沙龙(www.tech-meetup.com)

协办
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
北京大学北加州校友会
武汉大学北加州校友会
东南大学硅谷校友会
吉林大学硅谷校友会
复旦大学北加州校友会
华人事业互助会

华美信息存储协会
JayW Salon
Comments