Tuesday, March 6, 2018

2018/2/23 Event Record



Taiwanese Data Professionals (TDP) and Cafe Philo at New York are honored to invite James Powell and Robin Lee to speak about their experiences in organizing data community and working on reproducible data workflow.



James kicked off the evening by sharing his thoughts on the importance of building a community. He has been a rock star in organizing conferences and meetup groups related to Python and data science. So why building a community? Why bother volunteer? According to James, building community is about collective action -- doing something greater than what you can do with your own resources. While helping good people succeed, you are motivating them to do the same thing for others in the future. And in the long term, you are helping the community succeed. For James, building a community is truly about optimism about the society: Through collective action, we are creating the world that we envision to live in.



Robin’s talk, on the other hand, focused on reproducible, auditable, accurate, and collaborative SQL workflow. As analysts or data scientists working with data every day, a lot of us made mistakes when processing data. However, instead of blaming individuals on their personal failures, it is more important to change that mindset and examine the system to see how the system has failed. It is only through learning from mistakes and establishing a reliable, reproducible workflow that we become a better data scientist/analyst day by day. It is important to have this mindset and understand concepts such as modular code, dependency management, and executable analysis script for our everyday work. He then shared an SQL workflow example by using data for Hacker News.

No comments:

Post a Comment