Events‎ > ‎

Tachyon: an open source memory-centric distributed storage system

posted Jun 11, 2015, 11:53 AM by Hao Xu   [ updated Jul 2, 2015, 3:07 PM by 郭晓峰 ]
加入我们的community and

Time: 1:30PM ~ 3:40PM, 07/19/2015, Sunday

Location: 1320 Ridder Park Dr, San Jose, CA 95131


1:30pm - 1:50pm: Reception and social time

1:50pm - 2:30pm: Session 1 by Bin Fan

2:30pm - 3:10pm: Session 2 by Shaoshan Liu

3:10pm - 3:30pm: Q&A and offline networking with Bin Fan, Shaoshan Liu and Haoyuan Li

Tech Talks Abstract

  • Session 1: Tachyon overview

    Abstract: Tachyon is a memory-centric fault-tolerant distributed storage system, which enables reliable file sharing at memory-speed. It was born in UC Berkeley AMPLab. It is open source and is deployed at multiple companies. In addition, Tachyon has more than 100 contributors from over 30 institutions, including Baidu, IBM, Intel, and Yahoo etc. Earlier this year, the latest spinout from AMPLab, Tachyon Nexus, started to commercialize Tachyon. The company is funded by Andreessen Horowitz. It was also recently listed on 9 Hot Enterprise Storage Companies to Watch by Network World and Computer World. In this talk, we present an overview of Tachyon, as well as some recent development and use cases.

  • Session 2: Fast big data analytics with Spark on Tachyon in Baidu

    Abstract: In this talk we will focus on how Tachyon can help improve big data analytics (ad-hoc query) efficiency (up to 30x performance improvement) within Baidu. In detail, we will explain: Currently within Baidu, we have a production Tachyon cluster with 150 nodes and over 2 PB of storage space, this cluster mainly serves as the cache layer for our  Big Data Analytics engine. In this talk, first we introduce the Big Data Analytic infrastructure within Baidu.  Then, we explain why we started using Tachyon several months ago, as well as the problems encountered when we started using Tachyon. Next, we delve into the details of how Tachyon help accelerate our Big Data Analytics pipeline at its current state. At the end, we discuss what new features we want to see and the plan to scale further.

Speakers’ bio:

  • Bin Fan is a software engineer at Tachyon Nexus. He is a top committer of the Tachyon project. Prior to Tachyon Nexus, he worked in Google to build the core storage infrastructure and won Google's Technical Infrastructure award. Bin got his Ph.D. in computer science from Carnegie Mellon University.

  • Shaoshan Liu is currently a Senior Architect at Baidu U.S.A. working on Big Data Infrastructure. Before Baidu, he worked at Linkedin and Microsoft. Shaoshan has a Ph.D. from UC Irvine.

  • Haoyuan Li is founder and CEO of Tachyon Nexus. He is a Computer Science Ph.D. candidate in AMPLab at UC Berkeley, where he co-created Tachyon, an open source memory-centric distributed storage system. He is also a founding committer of Apache Spark. Before Berkeley, he worked at Conviva and Google. Haoyuan has a M.S. from Cornell University and a B.S. from Peking University



JayW Salon