Events

Feed icon 28x28
Strata2015 original

Video recording and production done by Strata + Hadoop

Strata 2015 Schedule

September 29 - October 1, 2015

( 14 available presentations )
Capture thumb
Rating: Everyone
Viewed 3 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

As Hadoop and the surrounding projects & vendors mature, their impact on the data management sector is growing. Amr will talk about his views on how that impact will change over the next five years. How central will Hadoop be to the data center of 2020? What industries will benefit most? Which technologies are at risk of displacement or encroachment?

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Amr Awadallah (Cloudera, Inc.):
Amr is Co-Founder and CTO of Cloudera. Prior to Cloudera Amr was an Entrepreneur in Residence at Accel Partners. Before that he served as Vice President of Engineering at Yahoo!, and led a team that used Apache Hadoop extensively for data analysis and business intelligence across the Yahoo! online services. Amr joined Yahoo! after they acquired his first startup, VivaSmart, in mid-2000. Amr holds a Bachelor’s and Master’s degrees in Electrical Engineering from Cairo University, Egypt, and a Doctorate in Electrical Engineering from Stanford University.

Capture thumb
Rating: Everyone
Viewed 7 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

Prevailing wisdom has long held that transactional and analytical data processing must occur in separate data stores.

However, hidden costs of architectural complexity, combined with technical challenges facing modern enterprises, call for versatile, multi-purpose solutions rather than separate specialized solutions.

As infrastructure advances to meet user expectations, modern database design erodes the barriers between OLTP and OLAP, giving way to a third kind of database that converges transaction and analytic data processing.

Businesses need to move from offline batch processing to real-time data pipelines so they can provide personalization, detect anomalies, analyze data continuously, and manage business operations as they happen.

Advances with in-memory storage and distributed architectures enable businesses to process more data faster than ever before, with projects like Spark and Hadoop spawning ecosystems of complementary technologies.

Yet, enterprises require a hub for unifying big data processing technologies that allows them to work in concert.

In this session, MemSQL CEO Eric Frenkiel will discuss the need for simplicity in enterprise data architecture, the convergence of transactions and analytics, and what is required to operationalize Spark and Hadoop in the enterprise.

About Eric Frenkiel (MemSQL):
Eric Frenkiel co-founded MemSQL and has served as CEO since inception. Before MemSQL, Eric worked at Facebook on partnership development. He has worked in various engineering and sales engineering capacities at both consumer and enterprise startups. Eric is a graduate of Stanford University’s School of Engineering. In 2011 and 2012, Eric was named to Forbes’ 30 under 30 list of technology innovators.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

Capture thumb
Rating: Everyone
Viewed 6 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

From the 2015 Strata + Hadoop World in San Jose, Michael Greene's keynote: "Intel and the Role of Open Source in Delivering on the Promise of Big Data".

The exponential growth of digitally stored data and the transition of data science from academia to real world applications hold the promise of improving nearly every aspect of our lives. Michael Greene’s talk will dive beneath the hype of “big data” and “analytics” to reveal what Intel is building with the open source community to ensure that all developers have the tools needed to change the world with Apache Hadoop and Apache Spark. Intel’s vision is a horizontal, reusable and extensible architectural framework around these two big data cornerstones that supports many big data domains, facilitates simple integration of data science innovation and allows developers to focus on building the products and services that deliver the benefits and profits of big data analytics.

This keynote is sponsored by Intel.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Michael Greene (Intel):
Michael Greene is vice president of the Software and Services Group and general manager of System Technologies and Optimization at Intel Corporation. Greene is responsible for delivering software and solutions that enable Intel and its customers to accelerate their software development activities, speed up the adoption of new Intel-based platforms and capabilities, and optimize enterprise and client applications. His team is charged with providing efficient platform software solutions to original equipment manufacturers (OEMs), BIOS developers, software partners and IT end users.

Since joining Intel in 1990 as an assembly test engineer, Greene has served in a variety of technical marketing, manufacturing, architecture and design roles. Most recently, he was director of platform technology and analysis, a role in which his responsibilities included evaluating future roadmaps for mobile, business, enterprise, graphics and software products.

Outside of Intel, Greene serves on the executive committee of the National GEM Consortium, a partnership that promotes the participation of underrepresented groups in post-graduate science and engineering education and the technical workforce.

Greene holds a patent in power-aware instruction scheduling and a bachelor’s degree in electrical engineering from the Massachusetts Institute of Technology.

Capture thumb
Rating: Everyone
Viewed 6 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

Wearables contribute to Big Data and the insights are already realizing significant gains in key industries:

Health – Provides a gateway for all kinds of non-invasive and ingestible monitoring: pulse, bp, skin impedance, oxygen concentration, alcohol level. The results have seen a 56% increase in patient satisfaction and 3000 hours of care time conserved.

Fashion-Retail – Signaling between outfits, say, letting everyone wearing the same cause-related shirt know when they are close by, or, warning everyone wearing the same cocktail dress at an event in case they want to stay away from each other.

Sensory enhancements – Bone-induction hearing, nail polish that detects drugged beverages. Early metrics cite 17% reduction in crime.

Each of these new middlemen have three things in common: (1) they are fully exploiting a confluence of data science, mobile and social infrastructure; (2) their efforts are less than two years old; and, (3) the results are nothing short of transformational.

The examples I offer, the fifth largest city in the U.S., one of the largest medical research centers in the world and a local SF clothier with now global reach, are upending their industries. We have hit a techno-behaviorial tipping point.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Lisa Hammitt (Salesforce):
Lisa Hammitt is a senior software executive with 25 years of industry experience. Most recently, as vice president of marketing of Salesforce Community Cloud, she is spearheading strategy and is charting out industry-led use cases that drive community adoption and reach. Before Salesforce, Hammitt headed mergers and acquisitions in information management and cloud computing at IBM and HP. She received B.A. degrees in economics and French from University of California, Berkeley, and completed graduate coursework in artificial intelligence at Stanford University.

Capture thumb
Rating: Everyone
Viewed 6 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

To get value out of today’s big and fast data, organizations must evolve beyond traditional analytic cycles that are heavy with data transformation and schema management. The Hadoop revolution is about merging business analytics and production operations to create the ‘as-it-happens’ business. It’s not a matter of running a few queries to gain insight to make the next business decision but, to change the organization’s fundamental metabolic rate. It is essential to take a data centric approach to infrastructure to provide flexible, real-time data access, collapsing data silos and automating data-to-action for immediate operational benefits.

This session is sponsored by MapR Technologies.

About Anil Gadre (MapR):
Anil Gadre is the SVP of Product Management at MapR. Prior to MapR, Anil was the EVP of Product Management at Silver Spring Networks, responsible for product strategy, planning and marketing of networking and software products focused on the Smart Grid for the energy industry. Before that, Anil was with Sun Microsystems, a Fortune 200 technology leader, serving as EVP of The Application Platform Software organization and had previously been the Chief Marketing Officer leading global branding, demand creation and an extensive developer ecosystem program. At Sun Microsystems his experience covered diverse product lines ranging from networked desktop and enterprise servers systems to market leading software products such as the Solaris Operating system, Java, MySQL database and various middleware products. He has a BSEE from Stanford University, and an MM degree from the Kellogg School at Northwestern University.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

Capture thumb
Rating: Everyone
Viewed 7 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

From the 2015 Strata + Hadoop World in San Jose, Poppy Crum's keynote address, "Data-driven Sensory Intelligence – Optimizing Our Perceptual Capabilities".

Our experience of the sensory world does not need to be constrained by our physical limitations. When navigating the environment our senses interact to perceive a robust non-veridical experience. Understanding these interactions and being able to define them perceptually and algorithmically allows technological developments that can facilitate sensory enhancement and optimization. Examples include use of targeted neuroplasticity to modify our internal experiences, personalized technologies for sensory augmentation, and maximizing the efficiency of our sensory capacity.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Poppy Crum (Dolby Laboratories | Stanford University):
Poppy Crum leads the Science Group at Dolby Laboratories and is a Consulting Professor at Stanford University in the Center for Computer Research in Music and Acoustics and the Program in Symbolic Systems. At Dolby, Poppy is directing the growth of internal science. She is responsible for integrating neuroscience and psychophysical knowledge into algorithm and technological development and technology strategy. At Stanford, Poppy’s work and course direction is focused on the impact and feedback potential of new technologies with gaming and immersive environments on neuroplasticity.
Poppy also represents scientific interests among external standards bodies and scientific organizations. She is currently a fellow of the US Defense Science Research Council and a U.S. representative to the International Telecommunication Union (ITU). Prior to joining Dolby Laboratories Poppy was Research Faculty in the Department of Biomedical Engineering at Johns Hopkins School of Medicine where her research focused on the functional circuitry of the auditory cortex. Poppy is a Fellow of the Audio Engineering Society. She completed her: Post-Doctoral work at Johns Hopkins in Biomedical Engineering; PhD at UC Berkeley in Neuroscience/Psychology; M.A at McGill University in Experimental Psychology, and B.Mus at the University of Iowa in Violin Performance.

Capture thumb
Rating: Everyone
Viewed 5 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

Open data is quickly gaining momentum and when applied as data for good, it becomes a much more powerful concept that we should all consider as good data stewards. Organizations to cities are starting to share data like traffic conditions or climate sensors and allowing others to use this open data to improve quality of life. But could this same open data be used for more nefarious reasons? Very likely so in this session we will bridge the balance between sharing data and locking it down, and how security should be on by default to protect all data including open data.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Eddie Garcia (Cloudera):
Eddie Garcia is chief security architect at Cloudera, a leader in enterprise analytic data management. He brings more than 20 years of information and data security experience to the company. Garcia is responsible for helping Cloudera enterprise customers reduce security and compliance risks associated with sensitive data sets stored and accessed in Apache Hadoop environments.

Formerly the VP of InfoSec and Engineering for Gazzang prior to its acquisition by Cloudera, Garcia architected and implemented secure and compliant Big Data infrastructures for customers in the financial services, healthcare and public sector industries to meet PCI, HIPAA, FERPA, FISMA and EU data security requirements. He was also the chief architect of the Gazzang zNcrypt product and is author of two patents for data security.

Capture thumb
Rating: Everyone
Viewed 7 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

President Barack Obama talks about the importance of Big Data and Data Science, and introduces Dr. DJ Patil as the first ever Chief Data Scientist and Deputy Chief Technology Officer for Data Policy.

Dr. Patil will work with the Office of Science and Technology Policy.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

Capture thumb
Rating: Everyone
Viewed 1 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

From the 2015 Strata + Hadoop World in San Jose, Solomon Hsiang's keynote address, "The Emerging Age of Data-Driven Policy Design: Examples from Trying to Manage the Global Climate".

Advances in data science empower leaders to make better decisions for society. By using new kinds of information unavailable during the last several millennia of government, we can avoid mistakes of the past. We will discuss how data and statistical inference are informing how we manage the global climate rationally, a defining policy challenge for our generation.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Solomon Hsiang (UC Berkeley):
Solomon Hsiang combines data with mathematical models to understand how society and the environment influence one another. In particular, he focuses on how policy can encourage economic development while managing the global climate. His research has been published in Science, Nature, and the Proceedings of the National Academy of Sciences.

Hsiang earned a BS in Earth, Atmospheric and Planetary Science and a BS in Urban Studies and Planning from the Massachusetts Institute of Technology, and he received a PhD in Sustainable Development from Columbia University. He was a Post-Doctoral Fellow in Applied Econometrics at the National Bureau of Economic Research (NBER) and a Post-Doctoral Fellow in Science, Technology and Environmental Policy at Princeton University. Hsiang is currently an Assistant Professor of Public Policy at the University of California, Berkeley and a Faculty Research Fellow at the NBER.

In 2013, Hsiang became the inaugural recipient of the American Geophysical Union’s Science for Solutions Award for “significant contributions in the application and use of Earth and space sciences to solve societal problems”.

In 2014, Hsiang was named in Forbes Magazine’s 30 Under 30 in Law and Policy.

Capture thumb
Rating: Everyone
Viewed 1 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

From the 2015 Strata + Hadoop World in San Jose, Adam Kocoloski's keynote address, "A Bigger Lens Through which to View the World- the IBM Twitter Alliance".

It’s about more than just big data – it’s about how computers and people can partner to enhance and scale human expertise. In a landmark partnership announced in October, IBM and Twitter are combining advances in analytics, cloud and cognitive computing in a manner that has the potential to transform how institutions understand customers, markets and trends. Twitter represents the public pulse of the planet, and for the first time, IBM is delivering systems with the scale and sophistication to put that information to work for all of us. In this short session, Adam Kocoloski, CTO of IBM Cloud Data Services and co-founder of Cloudant will explain how when it comes to gaining insights from data, the future is bright. We haven’t seen anything yet.

About Adam Kocoloski (IBM):
Adam is a Co-founder and CTO of Cloudant, and an IBM Distinguished Engineer. He is an Apache CouchDB developer, joining the project as one of the first ten committers, and the lead architect of a Dynamo-flavored clustering solution for CouchDB that serves as the core of Cloudant’s distributed data management platform. Adam’s record of open source contributions and thought leadership in the Erlang, NoSQL and web services communities include numerous publications and presentations at conferences, including Erlang Factory, Strata and SPIN. He recently presented a keynote address at MongoDB World in New York City where he announced Cloudant Query, a MongDB-style declarative query language that creates a better developer experience by reducing the learning curve to Cloudant. Adam received his Ph.D. in Physics from MIT in 2010, where he studied the gluon’s contribution to the spin structure of the proton. He is a proud father to three beautiful girls.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

Capture thumb
Rating: Everyone
Viewed 2 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

Roughly every decade, some kind of military or enterprise technology makes its way into the mainstream: the personal computer; the consumer Internet; the mobile phone; the Internet of Things. What happens when Big Data turns into a consumer product? Strata chair Alistair Croll offers some speculation about what data will do to the way we live, love, work, and play.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Alistair Croll (Solve For Interesting):
Alistair has been an entrepreneur, author, and public speaker for nearly 20 years. He’s worked on a variety of topics, from web performance, to big data, to cloud computing, to startups, in that time. In 2001, he co-founded web performance startup Coradiant (acquired by BMC in 2011), and since that time has also launched Rednod, CloudOps, Bitcurrent, Year One Labs, the Bitnorth conference, the International Startup Festival and several other early-stage companies.

Alistair is a chair for Strata + Hadoop World conferences; Techweb’s Cloud Connect; and the International Startup Festival. He’s written four books on analytics, technology, and entrepreneurship, including the best-selling Lean Analytics which is being translated into eight languages. He lives in Montreal, Canada and tries to mitigate chronic ADD by writing about far too many things at Solve For Interesting.

Capture thumb
Rating: Everyone
Viewed 4 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

As the Apache Spark userbase grows, the developer community is working to adapt it for ever-wider use cases. 2014 saw fast adoption of Spark in the enterprise and major improvements in its performance, scalability and standard libraries. In 2015, we want to make Spark accessible to a wider set of users, through new high-level APIs for data science: machine learning pipelines, data frames, and R language bindings. In addition, we are defining extension points to let Spark grow as a platform, making it easy to plug in data sources, algorithms, and external packages. Like all work on Spark, these APIs are designed to plug seamlessly into Spark applications, giving users a unified platform for streaming, batch and interactive data processing.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Matei Zaharia (Databricks):
Matei Zaharia started the Spark project at UC Berkeley and is currently CTO of Databricks. He serves as Spark’s vice president at Apache. In spring 2015, he is also beginning an assistant professor position at MIT.

Capture thumb
Rating: Everyone
Viewed 3 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

Data Science, where are we going? What impact can we expect? With a special introduction from President Barack Obama.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About DJ Patil:
Dr. DJ Patil has been the VP of Product at RelateIQ and the Data Scientist in Residence at Greylock Partners.

He has held a variety of roles in academia, industry, and government, including Chief Scientist, Chief Security Officer, and Head of Analytics and Data Teams at the LinkedIn Corporation. Additionally he has held a number of roles at Skype, PayPal, and eBay.

Capture thumb
Rating: Everyone
Viewed 14 times
Recorded at: September 29, 2015
Date Posted: February 23, 2016

We are often told that past holds lessons on how to approach the present, but we rarely look to older technologies for inspiration. Rarer still do we look at the historical experiences of less industrialized nations to teach us about the technological problems of today. In this talk, Eden Medina shares her research on Project Cybersyn, an early computer network built in Chile to further a political project for revolutionary change. She uses this history to outline important lessons for future data-driven technologies.

Watch more from Strata + Hadoop San Jose 2015: http://goo.gl/k9J3GB
Visit the conference website to learn more: http://strataconf.com/big-data-confer...
Subscribe to O’Reilly on YouTube! http://goo.gl/szEauh

Stay Connected to O'Reilly Media by Email - http://goo.gl/YZSWbO
Follow O'Reilly Media:
http://plus.google.com/+oreillymedia
https://www.facebook.com/OReilly
https://twitter.com/OReillyMedia

About Eden Medina (Indiana University, Bloomington):
Eden Medina is Associate Professor of Informatics and Computing and Director of the Rob Kling Center for Social Informatics at Indiana University, Bloomington. Her research uses technology as a means to understand historical processes and she brings together studies of technology, history, and law in her writings. Her work studies how political projects shape the design, function, and use of computer systems, how computers have been used historically to bring about structural changes in society, and how political innovation can spur technological innovation.

Medina holds a B.S.E. in Electrical Engineering from Princeton University, a Master’s in Studies of Law from Yale Law School, and a Ph.D. in the History and Social Study of Science and Technology from MIT. Her first book, Cybernetic Revolutionaries: Technology and Politics in Allende’s Chile (MIT Press) received the Edelstein Prize for outstanding book in the history of technology and the Computer History Museum Prize for outstanding book in the history of computing. She is also the recipient of the IEEE Life Member’s Prize in Electrical History awarded by the Society for the History of Technology. In 2014, she published the co-edited volume Beyond Imported Magic: Essays on Science, Technology and Society in Latin America (MIT Press).