Tianyu Zhang 张天宇

Tianyu Zhang 张天宇

Undergraduate AI Major. Dreamer. Doer.

Carnegie Mellon University

ByteDance IES

Biography

I am a junior at Carnegie Mellon University School of Computer Science. I major in Artificial Intelligence at SCS and minor in Computational Finance at MCS.

I started off interested in computer vision tasks and specifically re-identification. I helped build ML models in the baggage re-id and tracking system for Smart Security Inspection deployed at Beijing Daxing Int’l Airport at Nuctech summer 2019.

My current interest lies at the intersection of machine learning and scalable, distributed systems. I’m joining TikTok’s recommender systems team at ByteDance summer 2020 as an algorithm intern to obtain hands-on experience in devising ML algorithms and systems for billion-DAU products.

Besides CS, I’m an Economics hobbyist. I love to read about public choice and market theory. Academics aside, I’m into Chinese folk music; I’ve been a passionate Erhu player for over 14 years. I also love snowboarding and I’ve slid down mountains in Breckenridge and Chongli. I look forward to more quests in the mountains in the future.

Interests

  • Scalable/Distributed ML Systems
  • Computer Vision
  • Person/Object Re-identification
  • Learning Theory
  • Interpretable ML

Education

  • B.S. in Artificial Intelligence, Expected 2022

    Carnegie Mellon University

  • High School Diploma, 2018

    Beijing National Day School & Wasatch Academy

Languages & Tools

with self-evaluated levels of expertise

Python

Advanced

Java

Advanced

c

C/C++

Advanced

JavaScript

Intermediate

matlab

MATLAB

Intermediate

pytorch

Pytorch

Intermediate

spark

Spark

Beginner

React

Beginner

Node.js

Beginner

Experience

 
 
 
 
 

Machine Learning Algorithm Intern

ByteDance

Jun 2020 – Aug 2020 Beijing
(On-going) Improving recommendation algorithms for Douyin/TikTok.
 
 
 
 
 

Machine Learning Intern

Nuctech

May 2019 – Jul 2019 Beijing

Baggage Re-Identification for Smart Security Inspection: Extensively investigated past architectures for person and vehicle re-identification (re-ID) tasks. Implemented multiple architectures and evaluated their performance for baggage re-ID. Achieved a 0.76 accuracy of CMC rank-1 (improving the baseline by a 0.34 margin in accuracy and about 100x in speed) on the overall re-ID task (image retrieval from the 500-baggage gallery) on the Multi-View Baggage dataset.

Testing Framework Development: Developed a fork upon the open-source deep-person-reid framework for the team’s future research and deployment workflow, including features such as activation visualization, training checkpoint management, Comet.ml integration, and CLI tools.

 
 
 
 
 

Head Academic Staff

TechX Academy

May 2018 – Aug 2019 Shanghai

Instructor of Theory of Computation: Co-taught the 5-day seminar on theoretical computer science, covering topics such as discrete math fundamentals, computability (DFA and TM), and efficiency (polynomial reduction and P vs. NP). Designed slides, handouts, and assignments.

TA for AI Courses: TA-ed 10-day main courses. Designed homework and assessments for neural net basics, Markov Decision Process, Q-learning in Deep Reinforcement Learning (2018), and feature extraction, SVM, DP & Seam Carving in Computer Vision (2019). Led Q&A sessions and graded homework.

Academy Experience Design: Directed the Academic Team of TechX 2019. Led the processes of course design, staff search, and infrastructure setup for 5 courses.

 
 
 
 
 

Co-founder

Fintern

Apr 2018 – Apr 2019 Beijing

Product Management: Assisted the development of the platform from scratch using TypeScript and Node.js; generated user experience reports and feature addition proposals; beta-tested the platform with 100 users.

Business Operations: Drafted business plans and pitched to Tsinghua alumni investors, ZhenFund, and various other VCs; Invited 15+ Chinese corporations and organizations to enter Fintern.

Relevant Courses

Courses I’ve taken over the years. Items marked * are on-going.

Computer Science

  • Algorithms Design and Analysis*
  • Parallel and Sequential Data Structures and Algorithms
  • Introduction to Computer Systems
  • Great Ideas in Theoretical Computer Science
  • Functional Programming
  • Principles of Software Construction
  • Competitive Programming

Artificial Intelligence

  • Scalable Machine Learning*
  • Computer Vision*
  • Introduction to Machine Learning
  • AI: Representation and Problem Solving

Mathematics

  • Multivariate Calculus
  • Probability Theory
  • Linear Algebra
  • Discrete Math
  • Modern Regression*
  • Introduction to Mathematical Finance

Projects

More to come…

Time Series Analysis Framework

This framework is aimed to provide analytical tools and operations on time series from any domain and allows plugins to be written to import time series from custom sources and visualize them in custom ways. Written in Java with GUI constructed with Java Swing.

Malloc (Dynamic Memory Allocator)

Pinnacle of Intro to Comp Sys. A dynamic memory allocator with segregated list implementation. Supports 64-bit address space and achieves 74.6% average utilization.

Carcassonne (Board Game)

A multi-player tile-laying board game. GUI constructed with Java Swing.

Small Assembly Interpreter

Parses and simulates execution of Small Assembly (a 26-instruction subset of the standard Assembly language) on a virtual machine. Written in Java with an extensible design.

deep-person-reid MVB Fork

Deep learning re-identification in PyTorch. A forked version with additional functionalities and MVB interface.

MVB ReID with Verification Scheme

Models and experiments for the baggage re-id task using the verification scheme.

Microbiome

K-mer search enabled fast bacterial DNA sequence retrieval with DNA sequence analysis of Pittsburgh rivers.

Pop

A group-based crowdsourcing notification system. Never miss out on an impromptu event again! Built at CMU TartanHacks 2019.

VexBoard

A notepad built from Python Flask.

City Selection for Int’l Conferences

Venue Selection of International Conferences Based on Time Difference and Flight Fatigue: Explored the influence of jet lag and flight fatigue on travelers; built models to provide strategies for international conference venue selection. (IMMC2017, Finalist, MATLAB/Mathematica)

Language Distributions Prediction

Predicting the Distributions of Language Speakers and Its Application in Office Site Selections: Borrowing from the SIR model from epidemiology and the gravitational model from physics, built a model to analyze the spread of languages; utilized AHP to analyze and offer recommendations of global office setup for international corporations. (MCM2018, Meritorious, Python/MATLAB)

Recent & Upcoming Talks

The Theory of Computation

Understand the theoretical underpinnings of computer science.
The Theory of Computation

Not a Typical Talk on Recommender Systems

Understand recommender systems from a less technical perspective
Not a Typical Talk on Recommender Systems

Contact

Let’s talk about anything :)