Research
My research interests include system infrastructure for
big data processing, such as file and storage system, data management system,
and I/O systems in high-performance computing.
|
Student
Recruiting
There are GRA
openings for Ph.D students in Dr. Jiang’s group. Interested students shall
contact him by emailing their CVs and TOFEL/GRE scores.
Representative
and Recent Publications (see also Full
publication list)
·
Chen Zhong, Prajwal
Challa, Xingsheng Zhao, Song Jiang, “Buffered Hash Table: Leveraging
DRAM to Enhance Hash Indexes in the Persistent Memory”, in Proceedings of the 11th IEEE Non-Volatile
Memory Systems and Applications Symposium (NVMSA’22), August, 2022.
Taipei, Taiwan.
·
Lingfeng Xiang,
Xingsheng Zhao, Jia Rao, Song Jiang, Hong Jiang, “Characterizing the Performance of Intel
Optane Persistent Memory: a Close Look at its On-DIMM Buffering”, in
Proceedings of the European Conference on Computer Systems (EuroSys'22),
March, 2022.
- Chen
Zhong, Xingsheng Zhao, and Song
Jiang, "LIRS2: An Improved LIRS Replacement
Algorithm", in Proceedings of the
14th ACM International Systems and Storage Conference (SYSTOR'21),
June, 2021.
- Wenhui
Zhang, Xingsheng Zhao, Song Jiang, and Hong Jiang, “ChameleonDB: a Key-value Store
for Optane Persistent Memory”, in
Proceedings of the European Conference on Computer Systems (EuroSys'21),
April, 2021.
- Xingsheng
Zhao, Song Jiang, and Xingbo Wu, “WipDB: A Write-in-place Key-value
Store that Mimics Bucket Sort”, in
Proceedings of the 37th IEEE International Conference on Data Engineering
(ICDE’21), April, 2021.
- Wenshao
Zhong, Chen Chen, Xingbo Wu, and Song Jiang, “REMIX: Efficient Range Query for
LSM-trees”, in Proceedings of the 19th USENIX
Conference on File and Storage Technologies (FAST’21), February, 2021.
- Fan Ni and Song Jiang,
"RapidCDC: Leveraging
Duplicate Locality to Accelerate Chunking in CDC-based Deduplication
Systems", in Proceedings of 2019 ACM Symposium on Cloud Computing
(ACM SoCC'19), Santa Cruz, CA, November, 2019.
- Fan Ni, Xingbo Wu, Weijun Li, Lei Wang, and
Song Jiang, “Leveraging SSD's
Flexible Address Mapping to Accelerate Data Copy Operations”,
in Proceedings of the IEEE 21st International Conference on High
Performance Computing and Communications; (HPCC-2019), Zhangjiajie,
China, August, 2019.
- Fan Ni, Song
Jiang, Hong Jiang, Xingbo Wu, and Jian Huang, “SDC: A Software Defined Cache Supporting
Flexible Key-value-style Data Caching”,
in Proceedings of the 33th ACM International Conference on Supercomputing
(ICS'19), Phoenix, AZ, June, 2019.
- Fan Ni, Xing
Lin, and Song Jiang, “SS-CDC: A Two-stage Parallel Content-Defined
Chunking for Deduplicating Backup Storage",
in Proceedings of the 12th ACM International Systems and Storage
Conference (SYSTOR'19), Haifa, Israel, June, 2019.
- Zhuo Huang,
Song Wu, Song Jiang, and Hai Jin, “FastBuild: Accelerating Docker
Image Building for Efficient Development and Deployment of Containers”,
in Proceedings of 35th IEEE Symposium on Massive Storage Systems and
Technologies (MSST'19),
Santa
Clara, CA, May, 2019.
- Xingbo Wu,
Fan Ni, and Song Jiang, "Wormhole: A Fast Ordered Index for
In-memory Data Management", in Proceedings of the European
Conference on Computer Systems (EuroSys'19), Dresden, Germany,
March, 2019. [Source Code]
- Fan Ni,
Xingbo Wu, Weijun Li, Lei Wang, and Song Jiang. “WOJ: Enabling
Write-Once Full-data Journaling in SSDs by Using Weak-Hashing-based
Deduplication”. in Proceedings of the 36th
International Symposium on Computer Performance, Modeling, Measurements
and Evaluation 2018 (IFIP
Performance 2018), Toulouse, France, December 2018.
- Fan Ni, Xingbo Wu, Weijun Li, and Song
Jiang. “ThinDedup: An
I/O Deduplication Scheme that Minimizes Efficiency Loss due to Metadata
Writes”, in
Proceedings of the 37th IEEE International Performance Computing and
Communications Conference (IPCCC'18),
Orlando, Florida, November, 2018. (Best
Paper Candidate)
- Xingbo Wu, Fan Ni, and Song Jiang,
"Search Lookaside Buffer:
Efficient Caching for Index Data Structures", in Proceedings of 2017
ACM Symposium on Cloud Computing (ACM SoCC'17), Santa Clara, CA,
September, 2017.
- Fan Guo,
Yongkun Li, Yinlong Xu, Song Jiang, and John C. S. Lui, "SmartMD: A High
Performance Deduplication Engine with Mixed Pages",
in Proceedings of 2017 USENIX Annual Technical Conference (USENIX
ATC'17), Santa Clara, CA, July, 2017.
- Yuanyuan
Sun, Yu Hua, Song Jiang, Qiuyu Li, Shunde Cao, and Pengfei Zuo,
"SmartCuckoo: A Fast and
Cost-Efficient Hashing Index Scheme for Cloud Storage Systems",
in Proceedings of 2017 USENIX Annual Technical Conference (USENIX
ATC'17), Santa Clara, CA, July, 2017.
- Chunyi Liu,
Fan Ni, Xingbo Wu, Xiao Zhang, and Song Jiang, "Freewrite:
Creating (Almost) Zero-Cost Writes to SSD in Applications",
in Proceedings of the 10th ACM International Systems and Storage
Conference (SYSTOR'17), Haifa, Israel, May, 2017.
- Xingbo Wu,
Fan Ni, Li Zhang, Yandong Wang, Yufei Ren, Michel Hack, Sili Shao, and Song
Jiang, "NVMcached: An
NVM-based Key-Value Cache", in Proceedings of the 7th ACM SIGOPS
Asia-Pacific Workshop on Systems (APSys'16), Hong Kong, China,
August, 2016.
- Guoyao Xu,
Cheng-Zhong Xu, and Song Jiang, "Prophet: Scheduling Executors with
Time-varying Resource Demands on Data-Parallel Computation Frameworks",
in Proceedings of the 13th IEEE International Conference on Autonomic
Computing (ICAC'16), Wuerzburg, Germany, July, 2016.
- Xingbo Wu,
Li Zhang, Yandong Wang, Yufei Ren, Michel Hack, and Song Jiang,
"zExpander: a Key-value
Cache with both High Performance and Fewer Misses", in
Proceedings of the European Conference on Computer Systems (EuroSys'16),
London, UK, April, 2016.
- Xingbo Wu,
Wenguang Wang, and Song Jiang, "TotalCOW:
Unleash the Power of Copy-On-Write for Thin-provisioned Containers",
in Proceedings of the 6th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys'15),
Tokyo, Japan, July, 2015.
- Xingbo Wu,
Yuehai Xu, Zili Shao, and Song Jiang, "LSM-trie: An LSM-tree-based
Ultra-Large Key-Value Store for Small Data", in Proceedings of
2015 USENIX Annual Technical Conference (USENIX ATC'15), Santa
Clara, CA, July, 2015. [Source code of LSM-trie]
- Yuehai Xu,
Eitan Frachtenberg, and Song Jiang, "Building
a High-performance Key-value Cache as an Energy-efficient Appliance",
in Proceedings of the 32st International Symposium on Computer
Performance, Modeling, Measurement and Evaluation 2014 (IFIP
Performance'14), Turin, Italy, October, 2014. [Best Student Paper Award]
- Peng Wang,
Guangyu Sun, Song Jiang, Jian Ouyang, Shiding Lin, Chen Zhang, and
Jason Cong, "An Efficient
Design and Implementation of LSM-Tree based Key-Value Store on
Open-Channel SSD", in Proceedings of European Conference on
Computer Systems (EuroSys'14), Amsterdam, the Netherlands, April,
2014.
- Jian Ouyang,
Shiding Lin, Song Jiang, Zhenyu Hou, Yong Wang, and Yuanzheng Wang,
"SDF: Software-Defined Flash
for Web-Scale Internet Storage System", in Proceedings of the
Nineteenth International Conference on Architectural Support for
Programming Languages and Operating Systems (ASPLOS'14), Salt Lake
City, UT, March, 2014.
- Jie Fan, Song
Jiang, Jiwu Shu, Youhui Zhang, and Weimin Zhen, "Aegis: Partitioning Data Block for
Efficient Recovery of Stuck-At-Faults in Phase Change Memory", in
Proceedings of the 46th Annual IEEE/ACM International Symposium on
Microarchitecture (MICRO'13), Davis, CA, December, 2013.
- Berk
Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, and Mike
Paleczny, "Workload
Analysis of a Large-Scale Key-Value Store", in Proceedings of the
ACM SIGMETRICS/Performance 2012 Conference (SIGMETRICS'12), London,
United Kingdom, June, 2012.
- Xuechen
Zhang, Yuehai Xu, and Song Jiang, "YouChoose: A Performance
Interface Enabling Convenient and Efficient QoS Support for Consolidated
Storage Systems", in Proceedings of 27th IEEE Symposium on
Massive Storage Systems and Technologies (MSST'11), Denver, CO,
May, 2011. [Best Student Paper
Award]
- Yuehai Xu
and Song Jiang, "A
Scheduling Framework that Makes any Disk Schedulers Non-work-conserving
solely based on Request Characteristics", in Proceedings of the
9th USENIX Conference on File and Storage Technologies (FAST'11),
San Jose, CA, February, 2011.
- Xuechen
Zhang and Song Jiang, "InterferenceRemoval:
Removing Interference of Disk Access for MPI Programs through Data
Replication", in Proceedings of 24th ACM International Conference
on Supercomputing (ICS'10), Tsukuba, Japan, June, 2010.
- Xiaoning
Ding, Song Jiang, and Xiaodong Zhang, "A Framework Making Any
Replacement Algorithms (Almost) Lock Contention Free", in
Proceedings of the 25th IEEE Int'l Conference on Data Engineering (ICDE'09),
Shanghai, China, March, 2009.
- Xiaoning
Ding, Song Jiang, Feng Chen, Kei Davis, and Xiaodong Zhang, "DiskSeen: Exploiting Disk Layout
and Access History to Enhance I/O Prefetch", in Proceedings of
2007 USENIX Annual Technical Conference (USENIX'07), Santa Clara,
CA, June, 2007.
- Song Jiang,
Xiaoning Ding, Feng Chen, Enhua Tan, and Xiaodong Zhang, "DULO: an Effective Buffer Cache
Management Scheme to Exploit both Temporal and Spatial Locality",
in Proceedings of the 4th USENIX Conference on File and Storage Technologies
(FAST'05), San Francisco, CA, December, 2005.
- Song Jiang,
Feng Chen, and Xiaodong Zhang, "CLOCK-Pro: An Effective
Improvement of the CLOCK Replacement", in Proceedings of 2005
USENIX Annual Technical Conference (USENIX'05), Anaheim, CA, April,
2005.
- Song Jiang
and Xiaodong Zhang, "LIRS: An
Efficient Low Inter-Reference Recency Set Replacement Policy to Improve
Buffer Cache Performance", in Proceedings of the ACM SIGMETRICS
Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'02),
Marina Del Rey, CA, June, 2002.
Book
Impacts of Dr. Song Jiang's
Representative Work
·
LIRS is part of MySQL, the world's most popular
open source database, which are used by Google, Yahoo!, YouTube, and Wikipedia.
(see the LIRS-related code in
MySQL)
o The LIRS replacement algorithm has been officially incorporated
in the MySQL's most recent release (version 5.1), announced in November, 2008.
o Read the Documentation
on the Implementation.
o This Sigmetrics paper
and its journal paper describe the
algorithm.
o Towards
an O(1) VM, by Rik van Riel. (A Linux architect's view of LIRS in virtual
memory)
o The LIRS caching algorithm has been adopted in Infinispan data grid platform.
o A JBoss Community article
about LIRS in Infinispan.
·
Clock-pro has been officially adopted in NetBSD. (See Clock-pro
related code in NetBSD)
o "CLOCK-Pro: An
Effective Improvement of the CLOCK Replacement", in Proceedings of
2005 USENIX Annual Technical Conference (USENIX'05), Anaheim, CA, April,
2005, pp. 323-336.
o A report about its
Linux kernel implementation.
o The Linux
Kernel Development Page of Clock-Pro.
o Clock-Pro
patches in 2.6.12 developed by Rik van Riel from the Red Hat, Inc..
o An alternative
Linux kernel implementation of Clock-Pro by Peter Zijlstra.
o A Clock-pro patch is available for OpenLDAP (an open source implementation of
the Lightweight Directory Access Protocol), implemented by Howard Chu, Symas
Corp. (source code).
o Advanced
Page Replacement Projects established by the Linux Memory Management
Development Team.
o Clock-pro is referred as an example in the section of Linux and
Academia in Book Professional
Linux Kernel Architecture by Wolfgan Mauerer.
·
The swap token
algorithm has been officially adopted in Linux Kernel version 2.6. (See swap-token related code in
Linux)
o "Token-ordered LRU:
an effective page replacement policy and its implementation in Linux systems",
Performance Evaluation, Vol. 60, Issue 1-4, 2005, pp. 5-29.
o Discussion on the
implementation of the algorithm on lwn.net.
o The token algorithm is described
in the widely used Linux kernel book Understanding the Linux Kernel
(3rd edition) by D. P. Bovet and M. Casati.
o A detailed description of the token algorithm in Linux is
presented in a section
in book Professional
Linux Kernel Architecture by Wolfgan Mauerer.
o The token algorithm is referred as an example in the section of Linux and
Academia in Book Professional
Linux Kernel Architecture by Wolfgan Mauerer.
o The concept of "false
LRU pages" causing system thrashing is quoted in the Linux Memory
Management Wiki.
Students
·
Xingsheng Zhao (Ph.D.
Student)
·
Chen Zhong (Ph.D.
Student)
Alumni
·
Fan Ni (Ph.D., 2019, VMWare)
·
Xingbo
Wu (Ph.D., 2018, Assistant Professor, University
of Illinois at Chicago)
·
Yuehai Xu (Ph.D., 2014,
VMWare)
·
Xuechen Zhang
(Ph.D., 2012, Associate Professor at Washington State University Vancouver)
·
Jianqiang Ou (ME,
2015, PureStoragee)
·
Ke Liu (ME, 2013,
Qualcomm)
·
Yizhe Wang (ME, 2011,
VMWare)
Research Support
·
NSF, “Using Software
Defined Cache to Accelerate Index Search for In-memory Applications: Software
and Hardware Approaches”, 2018-2021, (sole PI).
·
NSF, "Taming Small Data Writes to Block Storage
Devices for Higher I/O Efficiency", 2015-2018 (sole PI).
·
NSF, "Adaptively Applying Data-driven Execution
Mode to Remove I/O Bottleneck for Data-intensive Computing", 2012-2016
(sole PI).
·
NSF, "Enabling
Dual-mode Execution for Removing I/O Bottleneck: A Highly Applicable Design and
Implementation", 2011-2016 (sole PI).
·
NSF, "CAREER:
Building Virtual Devices with QoS Assurance in a Consolidated Storage
Infrastructure", 2009-2014 (sole PI).
·
NSF,
"Collaborative Research: Algorithms Design and Systems Implementation to
Improve Buffer Management for I/O Data Accesses", 2007-2010, (sole PI).
·
NSF, "CRI:
Reconfigurable High Performance Cluster Computing and Medical Engineering
Applications", 2007-2009 (co-PI).
·
Twelve servers (Intel
8-core Xeon and 64GB DRAM) donated by Facebook.