PyData Berlin 2017

June 30th to July 2nd

Diversity Program

PyData is community of users and developers of python data tools. We firmly believe that the community becomes better with each new member. We welcome new member from diverse backgrounds and seek to actively aid people, especially people new to community, to enjoy their time at PyData.

In light of that PyData runs a diversity program to encourage new speakers by offering mentorship on talk and conference prepartion. The diversity program also includes limited number of scholarships for people who might not otherwise be aable to attend the conference. In 2017 we were able offer free or reduced priced tickets as well as travel reimbursements to 20 scholarships awardees. Focusing our efforts on the local community within the EU, allowed us to maximise the number of people we could support. The scholarship was 100% supported by sponsors and this year we received a generous donation from the Python Software Foundation, who have been a consistent supporter of PyData conferences around the world.

Python Software Foundation

You can read about Szilvia Téglás’ and Katinka Jeszenszki’s reflections on their first PyData here.

Impressions from the Conference

PyData Berlin 2017

Talks

55:57
Ethics in Machine Learning Panel
32:39
Ethics in Machine Learning Panel Q&A
54:44
The Future of Cybersecurity Needs You, Here is Why.
Verónica Valeros | Keynote
46:02
Keynote Q&A
Verónica Valeros
58:32
Are many of your worries about AI wrong?
Toby Walsh | Keynote
41:36
Keynote Q&A
Toby Walsh
55:25
Natural Language Processing: Challenges and Next Frontiers
Barbara Plank | Keynote
30:58
Keynote Q&A
Barbara Plank
32:46
Data Science & Data Visualization in Python
Radovan Kavicky
1:26:55
Best Practices for Debugging
Dr. Kristian Rother
23:45
Engage the Hyper-Python
Daniele Rapati
37:36
Where are we looking? Predicting human gaze using deep networks.
Oliver Eberle
40:26
Text similiarity with the next generation of word embeddings in Gensim
Lev Konstantinovskiy
34:56
Analysing user comments with Doc2Vec and Machine Learning classification
Robert Meyer
35:45
Compositional distributional semantics for modelling natural language
Thomas Kober
29:56
Conversational AI: Building clever chatbots
Tom Bocklisch
38:37
Semi-Supervised Bootstrapping of Relationship Extractors
David Soares Batista
33:45
Is That a Duplicate Quora Question?
Abhishek Thakur
38:49
Developments in Test-Driven Data Analysis
Nick Radcliffe
45:05
Evaluating Topic Models
Matti Lyra
32:13
A word is worth a thousand pictures: Convolutional methods for text
Tal Perry
36:25
Gold standard data: lessons from the trenches
Miroslav Batchkarov
57:46
The path between developing and serving machine learning models.
Adrin Jalali
1:24:37
Advanced Metaphors in Coding with Python
James Powell
1:44:00
Leveling up your Jupyter notebook skills
Gerrit Gruben
1:26:31
Introduction to Julia for Scientific Computing and Data Science
David HigginsRobert Schwarz
1:17:26
Pandas from the Inside / "Big Pandas"
Stephen Simmons
1:24:29
Introduction to Data-Analysis with Pandas
Alexander Hendorf
48:26
Topic Modelling (and more) with NLP framework Gensim
Bhargav Srinivasa Desikan
25:48
Social Networks and Protest Participation: Evidence from 130 Million Twitter Users
Jonathan Ronen
36:36
Data Science for Digital Humanities: Extracting meaning from Images and Text
Hendrik Heuer
36:08
AI assisted creativity
Roelof Pieters
30:00
Spying on my Network for a Day: Data Analysis for Networks
Aisha Bello
37:48
Biases are bugs: algorithm fairness and machine learning ethics
Françoise Provencher
42:24
Fairness and transparency in machine learning: Tools and techniques
Andreas Dewes
38:04
Deep Learning for detection on a phone
Irina Vidal Migallon
37:50
Towards Pythonic Innovation in Recommender Systems
Carlotta Schatten
42:54
“Which car fits my life?” - mobile.de’s approach to recommendations
Florian WilhelmArnab Dutta
35:35
On Bandits, Bayes, and swipes: gamification of search
Stefan Otte
33:12
Machine Learning to moderate ads
Vaibhav SinghJaroslaw Szymczak
32:09
Large Scale Vandalism Detection in Knowledge Bases
Alexey Grigorev
44:42
Open Data Use Cases
Ulrike Thalheim
39:17
Clean Code in Jupyter notebooks, using Python
Volodymyr (Vlad) Kazantsev
35:18
Fast Multidimensional Signal Processing using Julia with Shearlab.jl
Héctor Andrade Loarca
38:07
Polynomial Chaos: A technique for modeling uncertainty
Emily Gorcenski
39:47
TNaaS - Tech Names as a Service
Vincent D. Warmerdam
38:05
Finding Lane Lines for Self Driving Cars
Ross Kippenbrock
39:02
Kraft - Building smart IoT applications with Python and Spark
Rafael Schultze
38:03
Introduction to Search
Sirin Odrowski
37:23
Patterns for Collaboration between Data Scientists And Software Engineers
Karolina Alexiou
28:28
Kickstarting projects with Cookiecutter
Raphael Pierzina
30:03
Patsy: The Lingua Franca to and from R
Max Humber
1:10:26
fai Chow - Introduction to Machine Learning with H2O and Python
Jo
1:23:37
Introductory tutorial on data exploration and statistical models
Alexandru Agachi
39:10
Lightning Talks
49:15
Blockchains for Artificial Intelligence
Trent McConaghy