Felipe González-Pizarro

MSc. Computer Science graduate from UBC

I am a new NLP MSc. graduate from the University of British Columbia in Vancouver, Canada. I am seeking full-time positions in Data Science and Machine Learning in North America

You can find my resume here

Let's get in touch!

Email: fpeandrees[at]gmail.com

Technical Knowledge

Natural Language Processing

Building and training ML models for NLP tasks (e.g., topic modeling, hate speech detection) using NLTK, spaCy, Gensim, and HuggingFace.

Multimodal Learning

Deep learning for vision and language using Pytorch. Focus on Transformers, VAEs, RNNs, GANs, LSTMs, CNNs, Difussion models.

Information Visualization

Skilled in designing interactive visualizations with D3.js, Plotly, Seaborn, and Matplotlib using effective design principles and techniques.

Human Computer Interaction

Conducting user studies, prototyping and designing effective interfaces. Performing statistical analysis with Python/R.

Education

MSc. in Computer Science

September 2021 - Expected 2023

The University of British Columbia (UBC)

Vancouver, Canada

Focus on Natural Language Processing, Multimodal Learning, and Information Visualization under the supervision of Dr. Giuseppe Carenini.
Relevant coursework: Multimodal Learning with Vision, Language and Sound (CPSC 532S), Commonsense Reasoning in Natural Language Processing (CPSC 532V), Discourse in NLP (CPSC 532G), Information Visualization (CPSC 547), Computational Linguistics (CPSC 503), Topics in Human-Computer Interaction (CPSC 554).
Average grade 95%. Canadian GPA: 4.0/4.0

MSc. in Computer Science

March 2018 - September 2021

Universidad Técnica Federico Santa María (UTFSM)

Santiago, Chile

Focus on Social computing using Natural Language Processing, Deep Learning and Data Visualization techniques. Average grade 91/100. Canadian GPA: 4.0/4.0.

Bs. in Computer Science and Engineering

March 2012 - February 2018

Universidad Técnica Federico Santa María (UTFSM)

Santiago, Chile

Focus on Information Retrieval, Software Engineering, and Project Management. Average grade 81/100. Canadian GPA: 3.7/4.0. Passed subjects: 66/66. (Best Graduated Student, Rank: 1/32)

Studied abroad at Politecnico di Milano in Milan, Italy from February-August 2016, gaining valuable experience and insights as a Computer Science student in an international setting.

Selected Research Experience

Visiting Researcher (AI Research Scientist)

June 2021 - August 2021

Max Planck Institute for Informatics

Saarbrücken, Germany

Researched the use of large pre-trained models to detect hateful imagery. Developed a method to identify Antisemitic/Islamophobic social media posts. A dataset of Antisemitic/Islamophobic images is now available to aid researchers in creating better hate speech detection models. Supervisor: Dr. Savvas Zannettou.

Visiting Researcher (NLP Scientist)

January 2020 - December 2020

Dalhousie University

Halifax, Canada

Developed "TopicVisExplorer", a web-based tool that allows refinement and comparison of topic models with a novel topic similarity metric and refinement operations. A large user study was conducted to validate its usefulness. Supervisors: Dr. Evangelos Milios , Dr. Fernando Paulovich.

Researcher Assistant (Data Scientist)

March 2018 - December 2019

University of Washington (USA) - Universidad Técnica Federico Santa María (Chile)

Santiago, Chile

Developed a novel methodology for cross-language comparison of social media text to understand information privacy perspectives across different speakers. Analyzed unstructured textual social media data and published two conference papers on international computer science conferences. Supervisors: Supervisors: Dr. Cecilia Aragon (UW) , Dr. Claudia López (UTFSM).

Publications

Understanding and Detecting Hateful Content using Contrastive Learning

[Paper]

Felipe González-Pizarro & Savvas Zannettou (To Appear). In International Conference on Web and Social Media (ICWSM), 11 pages. [Acceptance rate: 20%]

Diversity-Aware Coherence Loss for Improving Neural Topic Models

Raymond Li, Felipe González-Pizarro, Linzi Xing, Gabriel Murray & Giuseppe Carenini (Submitted). Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), 10 pages. [Acceptance rate: 31.4%]

TopicVisExplorer: An interactive visualization tool to refine and compare topic models

Felipe González-Pizarro, Claudia López, Evangelos Milios, Fernando Paulovich & Marcelo Mendoza (Submitted). In ACM Transactions on Interactive Intelligent Systems, 30 pages. [Impact factor: 2.47]

Inequalities in Computational Thinking among Incoming Students in a STEM Chilean University

Felipe González, Claudia López, Carlos Castro & Andrea Vasquez. (To appear). IEEE Latin America Transactions Journal, 7 pages. [Impact factor: 1.10]

Regional Differences in Information Privacy Concerns After the Facebook-Cambridge Analytica Data Scandal

[Paper]

Felipe González-Pizarro, Andrea Figueroa, Claudia López & Cecilia Aragon. In Computer Supported Cooperative Work (CSCW): The Journal of Collaborative Computing and Work Practices, 45 pages. [Impact factor: 1.912]

Information Privacy Opinions on Twitter: A Cross-Language Study

[Paper] [Poster]

Felipe González, Andrea Figueroa, Claudia López & Cecilia Aragon. (2019, November). In Conference Companion Publication of the 2019 on Computer Supported Cooperative Work and Social Computing,CSCW'19, Austin, TX (2019), 4 pages. [Acceptance rate: 31.2%]

Global Reactions to the Cambridge Analytica Scandal: A Cross-Language Social Media Study

[Paper] [Slides]

Felipe González, Yihan Yu, Andrea Figueroa, Claudia López & Cecilia Aragon. (2019, May). In Companion Proceedings of The 2019 World Wide Web Conference, WWW'19, San Francisco, CA(2019), 8 pages [Acceptance rate: 18%]

Development of Computational Thinking on high school students: A case study on Chile

[Paper (in Spanish)] [Slides (in Spanish)]

Felipe González, Claudia López & Carlos Castro. (2018, November). In 2018 37th International Conference of the Chilean Computer Science Society IEEE (SCCC'18), Santiago, Chile (2018), 8 pages. Best Paper Award.

Revealing differences in Data Privacy Perspectives using Inter-Language Social Media Data

[Paper], [Poster]

Felipe González, Yihan Yu, Claudia López & Cecilia Aragon. (2018, November). Accepted on “Latin America as a Place for CSCW” Research Workshop of the 21st ACM Conference on Computer-Supported Cooperative Work and Social Computing

Instructor

March 2018 - August 2019

Computer Programming (IWI-131)

Universidad Técnica Federico Santa María (UTFSM), Chile

Python programming classes for freshman engineering students. Best ranked teacher for three semesters in a row. Class material (in Spanish) is available here

2018 (one semester)

Data Analysis and Visualization with Python (TMVP 2018-1)

Universidad Técnica Federico Santa María (UTFSM), Chile

Data science classes for computer science & and engineering students. Developed a class curriculum, lesson plans, and instructions about how to manage data and create meaningful visualizations using Python, Pandas, Matplotlib, Seaborn and Plotly. Class material (in Spanish) is available here

Teaching Assistant

Advanced Methods for Human Computer Interaction (CPSC-444)

Jan-Apr 2023

The University of British Columbia (UBC)

Vancouver, Canada

I facilitate weekly workshops where students develop web/mobile app prototypes, conduct experiments/user studies, and receive constructive feedback from me. I also grade their reports to help them succeed. Supervisors: Dr. Izabelle Janzen

Topics in Computer Science - Natural Language Processing (CPSC-436N)

Jan-Dec 2022

The University of British Columbia (UBC)

Vancouver, Canada

Teach students how to analyze and apply fundamental NLP algorithms and techniques. Text representation (e.g, language models), NLP Applications (E.g Topic modeling), Natural language understanding and generation Supervisors: Dr. Giuseppe Carenini, Dr. Vered Shwartz

Basic Algorithms and Data Structures (CPSC-221)

Sep-Dec 2021

The University of British Columbia (UBC)

Vancouver, Canada

Teach students about design and analysis of basic algorithms and data structures; algorithm analysis methods, searching and sorting algorithms, graphs and concurrency. Supervisor: Dr. Cinda Heeren

Project Management and Software Engineering (INF-360, INF-228)

2017 (two semesters)

Universidad Técnica Federico Santa María (UTFSM)

Santiago, Chile

Promote management skills (planning, organization, direction and control) in students through the follow-up to their projects of the XXV Feria de Software (Software Fair). Compliance with requirements and design of prototypes are reviewed. Supervisors: Prof. Pedro Godoy, Prof. Sergio Murua

Data Structures (INF-152)

2014-2015 (four semesters)

Universidad Técnica Federico Santa María (UTFSM)

Santiago, Chile

Teach students about the C/ C++ programming language. Teach students about data structures like linked list, stacks, queues, heaps, hash tables, trees and graphs. Supervisor: Dr. Diego Arroyuelo

Information Systems (INF-270)

2014 (one semester)

Universidad Técnica Federico Santa María (UTFSM)

Santiago, Chile

Upload online resources and grade assignments & final projects. Supervisor: Katherine Rivera

Honors and Adwards

Computer Science Merit Scholarship

September 2021 - September 2023

The University of British Columbia (UBC)

Canada

Outstanding Graduate Student applicant. CAD 20,000 for living expenses.

The Emerging Leaders in the Americas Program (ELAP)

January 2020 - July 2020

EduCanada

Canada

ELAP Scholarship provides outstanding students from Latin America with short-term exchange opportunities for research in Canada at graduate levels. $7,300 for travel and living costs.

The Cornell, Maryland, Max Planck Pre-doctoral Research School 2019

August 2019

Max Planck Institute for Software Systems

Germany

Got selected to attend to the CMMRS 2019 to learn about cutting-edge research in computer science at the Max Planck Institute for Software Systems, Germany. Travel and living costs fully funded.

National MSc. Grant

March 2019 - March 2020

National Commission for Scientific and Technological Research (CONICYT)

Chile

Top 7% Applicant. $9500 for tuition and living expenses.

La Serena School for Data Science: Applied tools for data-driven sciences

August 2019

Association of Universities for Research in Astronomy (AURA)

Chile

Got selected to attend to "La Serena School Data Science". Intensive week of interdisciplinary lectures focused on applied tools for handling big astronomical datasets. Travel and living costs fully funded. Our work project "Planetary Disk Direct Imaging" is available here

Travel Grant

November 2019

Universidad Técnica Federico Santa María

Chile

Travel Grant to support my participation at the 22nd ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2019), Austin, TX, United States.

Travel Grant

January 2019

Universidad Técnica Federico Santa María

Chile

Travel Grant to support my participation at the 2019 International World Wide Web Conference, San Francisco, United States

Incentive Program for Scientific Initiation (PIIC)

October 2018 - August 2019

Universidad Técnica Federico Santa María

Chile

$2600 for the research project: "Cultural Differences in Data Privacy Perspectives on Social Media".

Travel Grant

October 2018

Special Interest Group on Computer–Human Interaction (ACM SIGCHI)

United States

Travel Grant to support my participation at the "Latin America as a Place for CSCW Research" workshop and at the conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2018).

Travel Grant

October 2018

Universidad Técnica Federico Santa María

Chile

Travel Grant to support my participation at the 35th International Conference of the Chilean Computer Science Society (SCCC 2018)

MSc. Tuition scholarship

March 2018 - August 2021

Universidad Técnica Federico Santa María

Chile

Full tuition fees coverage on the MSc. Computer Science program

MSc. UTFSM scholarship

March 2018 - March 2020

Universidad Técnica Federico Santa María

Chile

$6700 for living expenses

Santander International Mobility Program

February 2016 - August 2016

Santander Bank

Italy

$5000 for travel and living expenses during a exchange experience at Politecnico di Milano, Milan, Italy.

Selected Projects

Visual Commonsense Generation & its incorporation into a Multimodal Topic Modeling algorithm

[Report] [Slides]

Proposing a general-purpose visual commonsense generation model, VisualCOMET+, for the creation of structured commonsense triplets that can be used for downstream tasks. Also showing the success of the model in generating coherent and diverse topics for multimodal topic modeling.

Contextualized Topic Models with Commonsense Knowledge

[Report] [Slides]

Incorporating commonsense knowledge into CTM was explored using COMET and ConceptNet NumberBatch embeddings. Clustering was also investigated as an alternative to LDA-inspired techniques, with a discussion of the trade-off between the two and motivation to incorporate corpus-level semantic relationships into neural topic models.

Discovering Interpretable Topics in Multimodal Data by Leveraging CommonSense Knowledge

[Report] [Slides]

A new topic modeling algorithm that incorporates commonsense knowledge to identify coherent topics in multimodal data is proposed. The algorithm was trained and evaluated on a dataset of 100K Antisemitic/Islamophobic posts from 4chan, demonstrating that injecting commonsense knowledge can improve the quality of topics generated by the algorithm.

Topic modeling User-Adaptive system

[Report] [Slides]

Introducing an extended version of MultiModalTopicExplorer, a topic modeling visualization tool that allows users to identify and explore topics that align with their preferences and understanding of the domain. Results from a user study demonstrate improved task success rates and reduced physical and mental demands when using the user-adaptive version of the tool.

MultimodalTopicExplorer: A Visual Text Analytics System for Exploring a Collection of Multimodal Online Conversations

[Report] [Slides]

MultiModalTopicExplorer is an interactive visualization tool for topic modeling that addresses the limitations of current visualizations by allowing users to perform a qualitative analysis of results and displaying relevant images for each topic. A user study showed positive results in terms of task success.