Project Idea Description
Topics: Storage systems, machine learning Skills: Python, PyTorch, Bash scripting, Linux, Machine Learning modeling Difficulty: Hard Size: Large (350 hours) Mentors: Yuyang (Roy) Huang (primary contact), Swami Sundararaman Contributor(s): Lihaowen (Jayce) Zhu In the realm of machine learning (ML), preprocessing of data is a critical yet often underappreciated phase, consuming approximately 80% of the time in common ML tasks.