Events

Notes:
最近活动越高越大,也碰到一些奇奇怪怪的事情。我们只是纯粹的为同学们提供一个分享技术的平台,非常不希望让自己陷入无可预测的囧境。为了避免这种情况,我们找了律师朋友,帮我们准备了一份免责申明。希望参加活动的同学,在参加活动前能事先在免责申明(http://tinyurl.com/wtm-waiver)上签个字并带过来。对此给同学们带来的不便,敬请谅解。多谢!

我们同时在跟我们的朋友联系,如果可以每个人只签一次名的话,只要你参加过一次活动,以后的活动,同学们就不用再签名了。希望能找到一个方便的方法让大家尽可能不那么麻烦。

多谢大家的理解!

CoreOS rkt, a Container Runtime

posted Oct 27, 2015, 8:23 PM by 郭晓峰   [ updated Oct 28, 2015, 8:06 AM ]

Registration link: http://tech-meetup-11-08-2015.eventbrite.com

Event Link: http://www.tech-meetup.com/events/11-08-2015

Join our community:  Please check http://www.tech-meetup.com


Abstract:

After being used and developed by Google since a decade ago, container technology recently becomes a hot topic in data center innovation. With the help of companies like Google, Redhat, CoreOS, Docker etc. The container ecosystem is now flourishing, there are a bunch of tools there to help you package, ship, and run your application almost everywhere from AWS, GCE to your tiny Raspberry PI cluster.

Although Docker gives users great experience to build and run containers, the industry still believes there are some pieces missing here. That's why CoreOS created APPC(App Container Spec) and worked with Companies like Google, Redhat, CloudFoundry, Mesosphere to push it as a container standard to address problems like image format, runtime spec, networking and other important issues. rkt (pronounced as Rocket) is a container runtime implementation for APPC created by CoreOS. By having a different design than Docker, it also tries to solve problems like security and composability. In this talk, Yifan Gu, a maintainer of the CoreOS rkt project, will discuss containers, APPC, rkt in much more details. Hopefully you can get an idea of what's container, why you should begin to use it after the talk.


Speaker’s bio:

Yifan is a maintainer of the rkt project at CoreOS(https://github.com/coreos/rkt). He also contributes to the Kubernetes project and strives to make it a good experience for people to use rkt in Kubernetes. Yifan is interested in system programing and debugging. He graduated from the CMU’s VLIS master program.


Language: Chinese


Time: 1:30PM ~ 4:00PM, 11/08/2015, [Sunday]

Location: 97 E Brokaw Rd, Ste 210, San Jose, CA 95112


Agenda:

1:30pm - 2:00pm: Reception and social time

2:00pm - 3:30pm: Talk and QA

3:30pm - 4:00pm: offline networking


主办

湾区同学技术沙龙(www.tech-meetup.com


协办

南京大学硅谷校友会

瀚海硅谷科技园

硅谷清华联网

中国科技大学校友会创业俱乐部

浙江大学校友会海纳创新创业俱乐部

北京大学北加州校友会

武汉大学北加州校友会

东南大学硅谷校友会

吉林大学硅谷校友会

复旦大学北加州校友会

华人事业互助会

Spark MLlib: Past, Present, and Future

posted Sep 20, 2015, 5:58 PM by peng du

Tech workshop: “Spark MLlib: Past, Present, and Future”

Links:


Time: 1:30PM ~ 4:00PM, 10/03/2015, Saturday

Location: 97 E Brokaw Rd, Ste 210, San Jose, CA 95112

Agenda:

1:30pm - 2:00pm: Reception and social time

2:00pm - 3:30pm: Talk and QA

3:30pm - 4:00pm: offline networking

Tech Talks Abstract:

Apache Spark provides primitives for in-memory cluster computing, which is well suited for large-scale machine learning purposes. MLlib is a standard component in Spark providing machine learning primitives, initially created and contributed to Spark by UC Berkeley. With 50+ companies and 180+ individual developers contributing to MLlib, it is one of the most active open source projects for machine learning. MLlib’s goal is to make practical machine learning scalable and easy, and the community has devoted lot of time and effort towards this goal. In this talk, we present a brief history of MLlib, summarize new features in Spark 1.5, and discuss the roadmap. We will show the expansion of MLlib’s feature set, the evolution of MLlib’s pipeline API, the elevation of MLlib’s performance, as well as the integration with other Spark components. We will also provide entry points for users and developers to get started with Spark MLlib.


Speaker’ bio:

  • Xiangrui Meng is an Apache Spark PMC member and a software engineer at Databricks. His main interests center around developing and implementing scalable algorithms for scientific applications. He has been actively involved in the development and maintenance of Spark MLlib since he joined Databricks. Before Databricks, he worked as an applied research engineer at LinkedIn, where he was the main developer of an offline machine learning framework in Hadoop MapReduce. His Ph.D. work at Stanford is on randomized algorithms for large-scale linear regression problems.


主办

湾区同学技术沙龙(www.tech-meetup.com


协办

南京大学硅谷校友会

瀚海硅谷科技园

硅谷清华联网

中国科技大学校友会创业俱乐部

浙江大学校友会海纳创新创业俱乐部

北京大学北加州校友会

武汉大学北加州校友会

东南大学硅谷校友会

吉林大学硅谷校友会

复旦大学北加州校友会

华人事业互助会

中美创新协会(CHAIN)


Cassandra: an open source distributed database

posted Aug 2, 2015, 11:02 PM by peng du   [ updated Sep 1, 2015, 1:48 PM by Lei Xia ]

Links:

Time: 1:30PM ~ 4:00PM, 08/29/2015, Saturday


Location: 97 E Brokaw Rd, Ste 210, San Jose, CA 95112


Agenda:

1:30pm - 2:00pm: Reception and social time

2:00pm - 3:30pm: Talk and QA

3:30pm - 4:00pm: offline networking


Tech Talks Abstract:

Apache Cassandra is perfect for managing large amounts of structured, semi-structured, and unstructured data across multiple data centers and the cloud. Cassandra delivers continuous availability, linear scalability, and operational simplicity across many commodity servers with no single point of failure, along with a powerful dynamic data model designed for maximum flexibility and fast response times. Cassandra powers many mission-critical applications at Netflix, Coursera, Intuit, UBS, Nvidia, Safeway, Orbeus, etc.  In this talk, we will focus on the core architecture of Cassandra, from read path, write path, internode communication, replication strategy to query language, data modeling, etc. We will also cover the architecture of DataStax Enterprise database platform build on top of Cassandra, tools to manage the Cassandra and the drivers to develop with Cassandra.


Speaker’ bio:

  • Charles Cao is a Director of Engineering at DataStax. He builds the enterprise-grade Cassandra and the Big Search on top of Cassandra and Solr.

  • Linkedin: https://www.linkedin.com/in/puertea


Other Information:

Cassandra Summit: 9/22-24/2015 - http://cassandrasummit-datastax.com


主办

湾区同学技术沙龙(www.tech-meetup.com


协办

南京大学硅谷校友会

瀚海硅谷科技园

硅谷清华联网

中国科技大学校友会创业俱乐部

浙江大学校友会海纳创新创业俱乐部

北京大学北加州校友会

武汉大学北加州校友会

东南大学硅谷校友会

吉林大学硅谷校友会

复旦大学北加州校友会

华人事业互助会

中美创新协会(CHAIN)


Video-Part 1




Video-Part 2



Tachyon: an open source memory-centric distributed storage system

posted Jun 11, 2015, 11:53 AM by Hao Xu   [ updated Jul 2, 2015, 3:07 PM by 郭晓峰 ]

Links:
本次活动注册表:http://techmeetup-20150719.eventbrite.com
本次活动详情链接:http://www.tech-meetup.com/events/20150719
加入我们的communityhttp://www.tech-meetup.com/wechat and http://www.tech-meetup.com/signup

Time: 1:30PM ~ 3:40PM, 07/19/2015, Sunday


Location: 1320 Ridder Park Dr, San Jose, CA 95131


Agenda:

1:30pm - 1:50pm: Reception and social time

1:50pm - 2:30pm: Session 1 by Bin Fan

2:30pm - 3:10pm: Session 2 by Shaoshan Liu

3:10pm - 3:30pm: Q&A and offline networking with Bin Fan, Shaoshan Liu and Haoyuan Li


Tech Talks Abstract


  • Session 1: Tachyon overview

    Abstract: Tachyon is a memory-centric fault-tolerant distributed storage system, which enables reliable file sharing at memory-speed. It was born in UC Berkeley AMPLab. It is open source and is deployed at multiple companies. In addition, Tachyon has more than 100 contributors from over 30 institutions, including Baidu, IBM, Intel, and Yahoo etc. Earlier this year, the latest spinout from AMPLab, Tachyon Nexus, started to commercialize Tachyon. The company is funded by Andreessen Horowitz. It was also recently listed on 9 Hot Enterprise Storage Companies to Watch by Network World and Computer World. In this talk, we present an overview of Tachyon, as well as some recent development and use cases.


  • Session 2: Fast big data analytics with Spark on Tachyon in Baidu

    Abstract: In this talk we will focus on how Tachyon can help improve big data analytics (ad-hoc query) efficiency (up to 30x performance improvement) within Baidu. In detail, we will explain: Currently within Baidu, we have a production Tachyon cluster with 150 nodes and over 2 PB of storage space, this cluster mainly serves as the cache layer for our  Big Data Analytics engine. In this talk, first we introduce the Big Data Analytic infrastructure within Baidu.  Then, we explain why we started using Tachyon several months ago, as well as the problems encountered when we started using Tachyon. Next, we delve into the details of how Tachyon help accelerate our Big Data Analytics pipeline at its current state. At the end, we discuss what new features we want to see and the plan to scale further.


Speakers’ bio:


  • Bin Fan is a software engineer at Tachyon Nexus. He is a top committer of the Tachyon project. Prior to Tachyon Nexus, he worked in Google to build the core storage infrastructure and won Google's Technical Infrastructure award. Bin got his Ph.D. in computer science from Carnegie Mellon University.

  • Shaoshan Liu is currently a Senior Architect at Baidu U.S.A. working on Big Data Infrastructure. Before Baidu, he worked at Linkedin and Microsoft. Shaoshan has a Ph.D. from UC Irvine.

  • Haoyuan Li is founder and CEO of Tachyon Nexus. He is a Computer Science Ph.D. candidate in AMPLab at UC Berkeley, where he co-created Tachyon, an open source memory-centric distributed storage system. He is also a founding committer of Apache Spark. Before Berkeley, he worked at Conviva and Google. Haoyuan has a M.S. from Cornell University and a B.S. from Peking University


主办
湾区同学技术沙龙(www.tech-meetup.com)

协办
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
北京大学北加州校友会
武汉大学北加州校友会
东南大学硅谷校友会
吉林大学硅谷校友会
复旦大学北加州校友会
华人事业互助会

华美信息存储协会
JayW Salon

Apache Samza: a distributed stream processing framework.

posted Jun 11, 2015, 11:53 AM by Lei Xia   [ updated Jun 28, 2015, 8:26 PM by 郭晓峰 ]

Links:
本次活动注册表:http://tech-meetup-6-21-2015.eventbrite.com
本次活动详情链接:http://www.tech-meetup.com/events/20150621
加入我们的community:http://www.tech-meetup.com/wechat and http://www.tech-meetup.com/signup

时间:  1:30PM ~ 3:30PM, 06/21/2015, Sunday

地点: 1320 Ridder Park Dr, San Jose, CA 95131

Tech Talk简介:
Apache Samza: a distributed stream processing framework.

Abstract:
The world is going real-time. MapReduce, SQL-on-Hadoop and similar batch processing tools are fine for analyzing and processing data after the fact — but sometimes you need to process data continuously as it comes in, and react to it within a few seconds or less. How do you do that at Hadoop scale?

Apache Samza is an open source stream processing framework designed for continuous data processing. Unlike batch processing systems such as Hadoop which typically has high-latency responses (sometimes hours), Samza continuously computes results as data arrives which makes sub-second response times possible. Samza has some unique features that make it powerful. It provides high performance for stateful processing jobs, including aggregation and joins between many input streams. It is designed to support an ecosystem of many different jobs written by different teams, and it isolates them from each other, so that one badly behaved job can’t affect the others.

At LinkedIn, we have been using Samza in production both for internal analytic purposes and for data products that are served on the live site. In this talk, we will focus on detailed architecture of Samza, and comparison with other major open-sourced streaming process frameworks.


报告人:
Yi Pan is a Staff Engineer and one of the Technical Leads in Data infrastructure team at LinkedIn. He has been a major contributor to Samza project at LinkedIn.

活动安排:

1:30pm - 1:50pm receiption and social time
1:50pm - 3:00pm talk and Q&A
3:00pm - 3:30pm: offline networking

主办
湾区同学技术沙龙(www.tech-meetup.com)

协办
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
北京大学北加州校友会
武汉大学北加州校友会
东南大学硅谷校友会
吉林大学硅谷校友会
复旦大学北加州校友会
华人事业互助会
华美信息存储协会

Video

Samza Talk (Jun 21, 2015)

Slides

samza_tech_talk_2015 - tech meetup.pptx



大数据时代的金融服务创新

posted May 18, 2015, 10:31 PM by 郭晓峰   [ updated May 19, 2015, 12:07 AM ]

Links:
本次活动注册表:http://tiny.cc/signup-20150531

时间
1:30PM ~ 3:30PM, 05/31/2015, Sunday
地点
Roof top conference room, Plug and Play Tech Center, 440 N Wolfe Rd, Sunnyvale, CA 94085

报告人简介
程立,阿里集团合伙人之一,蚂蚁金服集团(原支付宝)CTO。程立先生2005年加入Alibaba集团,他撰写了alipay.com最初的代码,并担任该系统首席架构师。他与同事一起创建了高可扩展、高可靠的蚂蚁金服平台,并支撑了大量创新性的金融服务。

程立先生于2000年在上海大学获得本科学位,并于2000至2004年在上海交大攻读博士学位。

报告简介
New technologies are changing every aspects of the business world. In this talk, Cheng Li will illustrate how Alibaba and Ant Financial innovate on cloud computing and big data technologies, and use the new technologies to change the financial services industry of China. He’ll tell the story of building a new distributed and fault tolerant database system called OceanBase, a financial-grade cloud computing platform, and an open full-stack financial big data platform. Based on these technology achievements, innovative financial services, such as the Alipay, Yu’e Bao, Ant micro loan, Internet Bank, can be quickly built and scaled to hundreds of millions users. Furthermore, he’ll share his view of the technology opportunities and challenges driven by the quick growth and innovation of internet finance

同行者
李静明,蚂蚁金服集团副总裁,美国总经理
谷雪梅,搜索事业部副总裁
王曦若,共享业务事业部副总裁
丁宏伟,菜鸟集团数据运营部研究员
蒋江伟,共享业务事业部资深技术专家
李强,天猫事业部总监
郭东白,AliExpress技术部总监
庄卓然,淘宝无线事业部总监

活动安排
1:30PM ~ 2:00PM, signup & social
2:00PM ~ 3:30PM, talk and Q&A session
3:30PM ~ 4:00PM, social with directors from Alibaba
这次来的人即使在杭州都不容易一次性见到,难得来硅谷,大家可以随意的做更多一点的技术交流。他们会在硅谷停留近一周的时间,如果某个方面有比较多的人感兴趣,我们还可以在后面的几天里安排时间让大家小范围的对具体的技术方向进行更深入的讨论。

主办
Alibaba Technology Forum (Silicon Valley)
湾区同学技术沙龙(www.tech-meetup.com)

协办
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
北京大学北加州校友会
武汉大学北加州校友会
东南大学硅谷校友会
吉林大学硅谷校友会
复旦大学北加州校友会
华人事业互助会
中美创新协会(CHAIN)
斯坦福中国学生学者联谊会(ACSSS)

May 9, 2015 - 大数据人工智能

posted Apr 17, 2015, 9:37 AM by 郭晓峰   [ updated May 10, 2015, 7:14 AM ]

Links:
- 本次活动注册表:http://tiny.cc/signup-20150509
- 本次活动详情链接:http://www.tech-meetup.com/events/20150509
- 本次活动问题收集:http://tiny.cc/faq-20150509
时间:1:30PM ~ 3:30PM, 05/09/2015, Saturday
地点:3600 Juliette Lane, Santa Clara, CA (Intel SC12)

报告人简介:
余凯博士,百度研究院副院长,百度深度学习实验室(IDL)主任, 兼任负责百度图片搜索部的高级总监,中组部“千人计划”国家特聘专家。在百度所领导的团队在广告、搜索、语音、图像等领域做出突出贡献,三次获得“百度最高奖”。2014年以来,他领导了百度大脑,百度自动驾驶,BaiduEye, 以及DuBike等一系列创新项目。他是国际著名机器学习专家,发表论文被引用达8000次,获机器学习顶尖会议ICML-2013的最佳论文奖银奖,曾任ICML和NIPS领域主席。2011年应邀在斯坦福大学计算机系主讲研究生课程“CS121:Introduction to Aritificial Intelligence”。曾在ImageNet等评测中屡获国际第一。他毕业于南京大学,于慕尼黑大学获得计算机博士学位,曾在微软,西门子,和NEC工作。他还是南京大学和北邮兼职教授,中科院计算所客座研究员,并被授予中关村高端领军人才和北京市海外高层次人才。

活动安排:
1:30PM ~ 3:30PM Talk and Q&A

主办:
湾区同学技术沙龙(www.tech-meetup.com)

协办:
Chinese American Information Storage Society(华美信息存储协会)
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
北京大学北加州校友会
武汉大学北加州校友会
东南大学硅谷校友会
吉林大学硅谷校友会
复旦大学北加州校友会
华人事业互助会

活动一览

Mar. 29, 2015 - Photon: Fault-tolerant and scalable joining of continuous data streams

posted Mar 14, 2015, 4:03 PM by Lei Xia   [ updated Mar 20, 2015, 4:05 PM by Ping Zhu ]

Links:

- 本次活动注册表:http://tinyurl.com/signup-20150329 
- 本次活动详情链接:http://www.tech-meetup.com/events/20150329 
- 本次活动问题收集:http://tiny.cc/faq-20150329
- 如果你想加入我们的mailing list,请移步http://www.tech-meetup.com/signup 

时间:1:30pm - 4pm, 03/29/2015, Sunday

地点:1601 McCarthy Boulevard, Milpitas, CA 95035 (TIPark Silicon Valley)

Tech Talk简介:

Photon: Fault-tolerant and scalable joining of continuous data streams

Abstract:
Photon is a highly fault-tolerant, scalable, low-latency and stateful distributed system to join multiple streams of data flowing continuously. Joining these data streams is critical to extract key metrics about Google ads system used for billing and internal analysis. Photon accomplishes exactly-once semantics and can automatically withstand datacenter-level outages, providing an order of magnitude higher uptime SLA relative to a single datacenter system. Our production deployment processes over one million events per second at peak with end-to-end latency of less than 30 seconds. In this talk, we will focus on detailed architecture of Photon, including a highly scalable paxos-based storage system.

Link to Photon publication: http://research.google.com/pubs/pub41318.html


报告人:
Tianhao is one of the Technical Leads in Ads Data infrastructure team at Google. He has been a major contributor to the design, implementation and launch of Photon.

活动安排:

1:30pm - 1:50pm receiption and social time
1:50pm - 2:10pm recruiting time: 20 minutes
2:10pm - 3:30pm talk and Q&A
3:30pm - 4pm: offline networking



主办: 湾区同学技术沙龙 (www.tech-meetup.com) 协办: TIPark Silicon Valley(感谢TIPark赞助场地) 南京大学硅谷校友会 硅谷清华联网 中国科技大学校友会创业俱乐部 浙江大学校友会海纳创新创业俱乐部 北京大学北加州校友会

武汉大学北加州校友会 东南大学硅谷校友会


Mar 1, 2015 - Large-scale data science and engineering with Spark

posted Feb 10, 2015, 11:15 PM by Mao Ye   [ updated Mar 3, 2015, 11:54 PM ]


Links:

- 本次活动注册表:http://tiny.cc/signup-20150301 - 本次活动详情链接:http://www.tech-meetup.com/events/20150301 - 本次活动问题收集:http://tiny.cc/faq-20150301 - 如果你想加入我们的mailing list,请移步http://www.tech-meetup.com/signup

时间:1:30pm - 4pm, 03/01/2015, Sunday


地点:1601 McCarthy Boulevard, Milpitas, CA 95035 (TIPark Silicon Valley)


Tech Talk简介:

Apache Spark has taken Big Data by storm, subsuming Hadoop MapReduce. In this talk, Reynold Xin from Databricks will give a quick introduction to Spark, with a focus on the latest development activities aimed at making large-scale data science and engineering more approachable. In particular, the following will be discussed:


- Spark's basic programming API

- the new DataFrame API for big data

- machine learning pipeline integration

- Databricks Cloud


报告人:

Reynold Xin is a committer and PMC member on Apache Spark. He is also a co-founder of Databricks. He has been instrumental in the development of Spark as the maintainer of many components. He recently led an effort to scale up Spark and set a new world record in 100 TB sorting (Daytona Gray). Before Databricks, he was pursuing a PhD at UC Berkeley AMPLab. He wrote the two highest cited papers in SIGMOD 2011 and SIGMOD 2013.


活动安排:

1:30pm - 1:50pm receiption and social time

1:50pm - 2:10pm recruiting time: 20 minutes

2:10pm - 3:30pm talk and Q&A

3:30pm - 4pm: offline networking


Slides:

Google Docs Video


活动一览:

Meetup-2015-03-01


招聘信息:

 

Google Docs Video

 

Google Docs Video

 

Google Docs Video

 

Google Docs Video




主办: 湾区同学技术沙龙 (www.tech-meetup.com) 协办: TIPark Silicon Valley(感谢TIPark赞助场地) 南京大学硅谷校友会 硅谷清华联网 中国科技大学校友会创业俱乐部 浙江大学校友会海纳创新创业俱乐部 北京大学北加州校友会

武汉大学北加州校友会 东南大学硅谷校友会


Jan 25, 2015 - Building a real time data platform with Apache Kafka

posted Jan 11, 2015, 10:39 PM by 郭晓峰   [ updated Feb 10, 2015, 11:24 PM by Mao Ye ]

时间:
1:30pm - 4pm, 01/25/2015, Sunday

地点:
TIPark Silicon Valley
1601 McCarthy Boulevard, Milpitas, CA 95035

Tech Talk简介:
Apache Kafka is a high throughput, distributed messaging system. Since it's open sourced, various companies such as Twitter, Netflix, Uber, Airbnb, Pinterest have adopted Kafka in their big data eco-system. In this talk, Jun will first explain how typical companies are using Kafka and the importance of Kafka in the whole big data eco-system. Next, Jun will describe some of the underlying technologies in Kafka that helped make it popular. Finally, Jun will introduce what we are doing at Confluent to build a Kafka-based real time data platform.

报告人:
jun_rao.jpg
Jun Rao
Cofounder of Confluent, project chair of Apache Kafka
www.linkedin.com/in/junrao

问题收集:
活动安排:
1:30pm - 1:50pm receiption and social time
1:50pm - 2:10pm recruiting time: 20 minutes
2:10pm - 3:30pm talk and Q&A
3:30pm - 4pm: offline networking

主办: 
湾区同学技术沙龙 (www.tech-meetup.com)

协办:
TIPark Silicon Valley(感谢TIPark赞助场地)
南京大学硅谷校友会
硅谷清华联网
中国科技大学校友会创业俱乐部
浙江大学校友会海纳创新创业俱乐部
复旦大学硅谷校友会
北京大学北加州校友会
东南大学硅谷校友会

注册表

slides

junrao-01-25-15.pptx


Video


活动一览

招聘企业信息

 

linkedin_20150125.pptx

 

Palantir_20150125.pptx

 

tango_20150125.pptx

 

yahoo_20150125.pptx


1-10 of 28