- This event has passed.
Livestream: Visualizing ArXiv.org with PyGraphistry
September 7 @ 8:30 pm - 9:30 pm
NOTE: Please sign-in at via the Livestream link below. Thank you!
ArXiv.org is an amazing source for freely available research papers across a variety of scientific/quantitative disciplines. With millions of papers from Physics to Biology, or Computer Science to Mathematics, ArXiv provides a rich resource for exploring cutting edge research topics over time. Rather than on the papers’ content, however, the focus of this talk will be on the relationships between them. Specifically, we will use the references to other ArXiv papers in the citations to build a giant network graph of ArXiv publications.
The talk will consist mainly of 2 pieces:
(1) Retrieving and parsing citations from ArXiv.org and
(2) Building, analyzing, and visualizing the huge network graph in cool ways using (Py)Graphistry (https://www.graphistry.com/).
Graphistry is an awesome tool for visualizing large-scale graphs. With just a few lines of Python code you can interactively explore a massive network graph of ArXiv papers. With this you can see so many things in an instant! Communities forming, landmark papers, and the changing landscape of scientific research over time (plus more!) are all quickly discoverable from this graph.
About the Speaker
Paul Burkard comes to data science as a physics convert. He studied physics and mathematics at MIT before taking a job with a government intelligence contractor. There he entered the realm of data science doing groundbreaking work with large scale text analytics via a technology called Latent Semantic Indexing (LSI). In this context, LSI allowed analysts to tease out deep yet subtle relationships between entities, concepts, etc. from hundreds of millions of raw text documents. After receiving a Master’s in Machine Learning, he pursued a more mainstream data science role and spent the last few years leading a team building models on user and advertising data for over nearly a billion users with Hadoop and AWS. During this time, he also taught night courses for professionals aspiring to be data scientists. Paul is a huge sports fan, and also loves travel and learning most anything new.
Metis (thisismetis.com) accelerates careers in data science by providing full-time immersive bootcamps, evening part-time professional development courses, online resources, and corporate programs based in Seattle, New York, Chicago, and San Francisco.
Brought to you by Kaplan, Metis focuses primarily on Python, machine learning, data visualization, deep learning, big data processing, statistical foundations, and more. Students and alumni of the bootcamp program receive continuous support from our career advisors, empowering them to pursue a successful career in the fast-growing field of data science.
Learn more about us at https://thisismetis.c….
Sign up for Demystifying Data Science: A Free Online Conference:
Join our Metis Community Slack channel! Apply here: https://bit.ly/metis-…
Metis Code of Conduct
Metis is dedicated to providing a harassment-free experience for everyone, regardless of gender identity, age, sexual orientation, disability, physical appearance, body size, race, or religion (or lack thereof).
We do not tolerate harassment of students, staff, or visitors in any form. Sexual language and imagery is not appropriate for any event including talks, workshops, parties, and other online media. Individuals and groups that do not abide by these rules will be asked to leave and, if necessary, prohibited from future events.
If you have any questions or you’re made to feel uncomfortable by anyone on our campus or at one of our offsite events, please let one of the staff members know right away. The matter will be taken seriously and promptly addressed.