CSE Colloquium: Machine Learning with Limited Compute and Data Resources

Zoom Information Link 

Topic: Qian Yang Faculty Candidate Colloquium 

Time: Mar 17, 2020 10:30 AM Eastern Time (US and Canada) 

Join from PC, Mac, Linux, iOS or Android: https://psu.zoom.us/j/696372512 

Or iPhone one-tap (US Toll): +16468769923,696372512# or +16699006833,696372512# 

Or Telephone: 

Dial: 

+1 646 876 9923 (US Toll) 

+1 669 900 6833 (US Toll) 

Meeting ID: 696 372 512 

International numbers available: https://psu.zoom.us/u/adBsnTPXaN 

Abstract: Recent innovations in artificial intelligence and machine learning not only excel at numerous tasks in industry and academia but also keep impacting our daily lives. With the power of large datasets and the availability of compute resources such as graphics processing units (GPUs) and tensor processing units (TPUs), machine learning models have grown with increasingly better performance. However, their success relies heavily on the surge of big computing and big data. There remain challenges emerging in the real world given limited resources. This talk describes a series of new machine learning algorithms that draw on utilizing the model and data efficiently and effectively given limited resources. 

For the challenges raised by limited compute resources, which is still an underexplored research area, this talk will present the first model-parallel algorithm to parallelize the training of Transformer-based language models with over a billion parameters. This talk will also consider platforms with limited memory and computation ability, such as mobile devices, and introduce different strategies to transform continuous and generic sentence embeddings into a binarized form, while preserving their rich semantic information. At the same time, in terms of limited data resources, recent success in natural language processing (NLP) tasks has been achieved for popular languages like English that have text corpora of hundreds of millions of words. However, these are only about 20 languages out of approximately 7,000 languages in the world. For low-resource language pairs, this talk will demonstrate a joint training algorithm for pivot-based neural machine translation by bridging source and target languages with a pivot language. This talk will also propose the first end-to-end conditional generative model for generating paraphrases via adversarial training. Finally, I will discuss applications of machine learning techniques in interdisciplinary research, including biomedical data science, human-computer interaction, and education. 

Biography: Qian (Lara) Yang is a postdoctoral researcher at Duke University, working with Lawrence Carin. Before that, she worked with Rebecca J. Passonneau at both Columbia University and Pennsylvania State University, and Hermann Ney at RWTH Aachen University in Germany. She received her Ph.D. in Computer Science and Technology from Tsinghua University in 2017, advised by Gerard de Melo. Her research focuses on a wide range of topics in Natural Language Processing, Machine Learning, and the intersection of Machine Learning with Biomedical Data Science, Education, and Human-Computer Interaction. She is the recipient of the Educational Advances in Artificial Intelligence (EAAI) New and Future AI Educator award (funded by grants from NSF, AIJ, AAAI), the Google Women Techmakers Scholarship, an RWTH – Tsinghua Research Fellowship (provided by DAAD in Germany), and nominated for the 2019 Outstanding Postdoc at Duke University award. She has served as a session chair at AAAI and on program committees for major conferences such as NeurIPS, ICML, ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, and KDD. 

 

Share this event

facebook linked in twitter email

Media Contact: Rebecca Passonneau

 
 

About

The School of Electrical Engineering and Computer Science was created in the spring of 2015 to allow greater access to courses offered by both departments for undergraduate and graduate students in exciting collaborative research fields.

We offer B.S. degrees in electrical engineering, computer science, computer engineering and data science and graduate degrees (master's degrees and Ph.D.'s) in electrical engineering and computer science and engineering. EECS focuses on the convergence of technologies and disciplines to meet today’s industrial demands.

School of Electrical Engineering and Computer Science

The Pennsylvania State University

207 Electrical Engineering West

University Park, PA 16802

814-863-6740

Department of Computer Science and Engineering

814-865-9505

Department of Electrical Engineering

814-865-7667