New ! Search
Full Menu and site Navigation
A better world through a systems approach

INCOSE-LA- Spkr Meeting Big Data

  • Date:
    Jul 7, 2016 - 05:30 PM - 08:00 PM
  • Venue:
    Caltech, Annenberg Center for Information Science and Technology, 330 S Chester Ave, Pasadena, CA.

INCOSE-LA Speaker Meeting 

TOPIC: Big-Data Technology Innovation: Hadoop, Real-time, and Machine Learning 

PRESENTER: Andy Feng, VP Architecture, Yahoo! 

HOST VENUE: Caltech, Annenberg Center for Information Science and Technology, 330 S Chester Ave, Pasadena, CA 

WHEN:  Thu, July 7, 2016
5:30 – 6:20pm   Registration, Networking, Refreshments 
6:20 – 6:30pm   Welcome and Announcements 
6:30 – 8:00pm   Meeting and Q&A 

Host site: Caltech, Annenberg Center for Information Science and Technology, 330 S Chester Ave, Pasadena, CA.

Remote Sites (potential): AV College in Lancaster; Boeing in Huntington Beach, Capstone Turbine in Chatsworth, JPL in La Canada, Control Point Corp in Goleta, NGC (employees only) in Azusa.   

Webcast virtual site: Register as an individual virtual participant.  (You must RSVP by Tuesday noon July 5, 2016.)

 See below for directions to the host site, and how to RSVP for a remote webcast site.  

COST: Members: FREE.  Non-members: $10.

Please RSVP by Tues, July 5, 2016  to facilitate event registration and planning: 
Seats are limited. First registered/come first served.

 You will be asked to provide your email address, name, title, company, phone number, and membership and citizenship information, and to identify at which site you will be attending (host site, remote site or virtual/standalone).   

 If you are uncertain whether or not you'll be able to attend, DO make a reservation and indicate that you're "probable". This helps with the planning. 

ABSTRACT: Yahoo started developing big-data technology with Hadoop MapReduce and File System in 2006, and made it an Apache open source project in 2009. Since then, big data has become a major component of the global tech industry, and Yahoo is leading the way. In the past three years, Yahoo has been a leading contributor to Apache Storm for event processing, Apache HBase for distributed NoSQL stores, Apache Spark for faster processing, and Druid for sub-second analytics. We have created new open source projects such as Apache Omid for transactional support of NoSQL stores, Yahoo Data Sketches for approximate analytics, and Yahoo CaffeOnSpark for distributed deep learning.  

 In this talk, we walk through Yahoo use cases (search, advertising, personalization, and Flickr) where our big-data technologies are best exemplified. We explain how Yahoo leverages these technologies to perform real-time processing and advanced machine learning against 600 petabytes of data, and describe the system architecture of our heterogeneous clusters of 40,000 servers for supporting a variety of workloads. We provide an overview of open source technologies (Apache Storm, Apache HBase, Apache Omid, and Yahoo CaffeOnSpark) and our in-house technology for large-scale machine learning.  We discuss how academic researchers and industry technologists can help advance big-data technologies further. 

BIOGRAPHY: Dr. Andy Feng is a VP of Architecture at Yahoo leading the architecture and design of big data and machine learning initiatives. He’s architected major platforms for personalization, ad serving, NoSQL, and cloud infrastructure. Prior to Yahoo, he was a Chief Architect at Netscape/AOL, and Principal Scientist at Xerox. 

Virtual Attendance for Individual Participants: We are providing virtual meeting capability for this event so that those who might otherwise be unable to participate may do so. To receive the instructions for virtual participation, please register and select the option of “Virtual” as the site. Instructions on how to connect will be provided. 

Virtual registration is open until the meeting is concluded.  If you want virtual login information emailed to you, register by Mon, July 5


First column: + operational, ? unknown, x not operational [a request has been issued, contact the site if attendance is contingent upon a site being available]  

I the remote site is not confirmed, please feel free to contact the site POC identified. 




Coordinator contact information



Reservation Deadline


Antelope Valley/Palmdale

Mike Wallace

Open to all


Azusa: Northrop Grumman Corporation

Maria Alvarez

NG employees and Azusa NG Contractors only


Goleta: Control Point Corp, 110 Castilian, Suite 200, Goleta CA

Paul Stowell

Open to all


Pasadena: JPL, 4800 Oak Grove Dr., Pasadena

Michela Munoz-Fernandez

Register minimum one (1) week prior to meeting


Huntington Beach: Boeing

14900 Bolsa Chica Rd

Bldg. 17, 1st floor, C/R 109

Bob Noel, Beth O’Donnell, elizabeth.l.o'

US Citizens, Resident Aliens only

Non-Boeing visitors must arrive prior to 6:00pm to get visitor’s badge

3 days


El Segundo: Aerospace D8/1010, 200 N Aviation Blvd

David Jablonski

Open to all, but with special requirements for foreign nationals

One week before the meeting; foreign nationals, 10 days


Host Site:

Pasadena: Caltech, Annenberg Center for Information Science and Technology, 330 S Chester Ave, Pasadena

Rick Hefner

Open to all





The Annenberg Center for Information Science and Technology is located on the Caltech campus in Pasadena.  A map is located at  Visitors may park in any unreserved spot in Lot 9Lot 10, or the Holliston parking structure; no permits are needed after 5:00p. 

INCOSE is the International Council on Systems Engineering, a not-for-profit membership organization founded in 1990 to develop and disseminate the interdisciplinary principles and practices that enable the realization of successful systems. Website: 

INCOSE-LA website: