Acquired a Ph.D degree in 2002 (X-ray spectroscopy) and served as a researcher in Samsung Electronics, Huawei, HGST, VmWare, ByteDance and IBM. As a Linux kernel developer maintains and contributes to HFS/HFS+, NILFS2, CephFS file system drivers, and designed a SSDFS open-source file system and ML library in Linux kernel. Research interests include file systems and data storage design, neuromorphic computing, data-centric and memory-centric computing, cognitive computing, and quantum computing. Around 80 patents have been granted and 5 papers have been published.
File system is a fundamental and crucial technology of managing a digital data. But current file system stack introduces multiple drawbacks. Machine Learning (ML) technologies need to be adopted in file system stack and AI agents become a new customer of digital data. Cognitive file system concept could enhance the old and robust file system technologies with the goal of managing data more efficiently and satisfying new types of customers. Cognition feature implies that file system can “recognize” repeatable patterns and relations in raw data streams. User can still store the raw data in streams/files but without the necessity to give them names. Cognition subsystem can detect repeatable patterns in raw data streams and build a dictionary. Patterns in dictionary can work as keywords that can build relations among data streams likewise a relational database. Cognitive file system concept can build completely new technological foundation for computation offloading in storage space, adopting ML models for data analysis/processing, and provide a flexible interface for more efficient interaction among AI agents and data.