2020年度历次组会列表

2020上半年组会信息


日期 汇报人 报告题目 备注 链接
1月4日 周仁杰 Multi-Tenant Multi-Objective Bandwidth Allocation in Datacenters Using Stacked Congestion Control INFOCOM 2017 paper
slides
1月11日 严锦立 Injection Time Planning: Making CQF Practical in Time-Sensitive Networking INFOCOM 2020 paper
slides
2月29日 汪杨海 More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server NIPS 2013 paper
slides
Exploiting Bounded Staleness to Speed Up Big Data Analytics ATC 2014 paper
slides
3月7日 王长宏 Combining Source-adaptive and Oblivious Routing with Congestion Control in High-performance Interconnects using Hybrid and Direct Topologies TACO 2019 paper
slides
Straightforward solutions to reduce HoL blocking in different Dragonfly fully-connected interconnection patterns TJSC 2016 paper
slides
3月14日 金康 Efficient Memory Disaggregation with INFINISWAP NSDI 2017 paper
slides
周仁杰 Eiffel: Efficient and Flexible Software Packet Scheduling NSDI 2019 paper
slides
3月21日 欧阳硕 BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training NIPS 2018 paper
slides
袁郭苑 Minimal Rewiring: Efficient Live Expansion for Clos Data Center Networks NSDI 2019 paper
slides
3月28日 马俊超 Generalisation of Recursive Doubling for AllReduce: Now with Simulation PARCO 2017 paper
slides
4月4日 黄山 Re-architecting Congestion Management in Lossless Ethernet NSDI 2020 paper
王育洋 FatPaths: Routing in Supercomputers, Data Centers, and Clouds with Low-Diameter Networks when Shortest Paths Fall Short CoRR 2019 paper
4月11日 徐叶茂 Massively Parallel Hyperparameter Tuning SysML 2020 paper
胡鼎煌 SocksDirect: Datacenters Sockets can be Fast and Compatible SIGCOMM 2019 paper
slides
4月18日 吴克 Measuring Congestion in High-Performance Datacenter Interconnects NSDI 2020 paper
slides
王笑雨 iRDMA: Efficient Use of RDMA in Distributed Deep Learning System HPCC 2017 paper
4月25日 杨维铃 LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation SC 2016 paper
5月9日 周煜琨 ECN+: A marking-aware optimization for ECN threshold via per-Port in Data Center Networks JNCA 2019 paper
5月25日 王长宏 Understanding congestion in high performance interconnection networks using sampling SC 2019 paper
汪杨海 Horovod: fast and easy distributed deep learning in TensorFlow CoRR 2018 paper
5月30日 金康 A Complete Key Recovery Timing Attack on a GPU HPCA 2016 paper
白洋 Aeolus: A Building Block for Proactive Transport in Datacenters SIGCOMM 2020 paper
6月6日 欧阳硕 AdaComp: Adaptive Residual Gradient Compression for Data-Parallel Distributed Training AAAI 2018 paper
周仁杰 Support ECN in Multi-Queue Datacenter Networksvia per-Port Marking with Selective Blindness ICDCS 2018 paper
6月14日 马骏超 Scaling Distributed Machine Learning with In-Network Aggregation Computer Science 2019 paper
袁郭苑 Expanding across time to deliver bandwidth efficiency and low latency NSDI 2020 paper
6月21日 徐叶茂 EFLOPS: Algorithm and System Co-design for a High Performance Distributed Training Platform HPCA 2020 paper
王育洋 Mitigating Network Noise on Dragonfly Networks through Application-Aware Routing SC 2019 paper
7月5日 谢徐超 TCP ≈ RDMA: CPU-efficient Remote Storage Access with i10 NSDI 2020 paper
7月12日 吴克 DRAIN: Deadlock Removal for Arbitrary Irregular Networks HPCA 2020 paper
胡鼎煌 A Distributed Algorithm to Calculate Max-Min Fair Rates Without Per-Flow State POMACS paper
7月19日 杨维铃 Optimizing N-dimensional, winograd-based convolution for manycore CPUs PPoPP 2018 paper
王笑雨 Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models NeurIPS 2019 paper
7月26日 周煜琨 Dart: Divide and Specialize for Fast Response toCongestion in RDMA-based Datacenter Networks TON 2020 paper
8月16日 白洋 Programmable Calendar Queues for High-speed Packet Scheduling NSDI 2020 paper
王长宏 Global link arrangement for practical Dragonfly ICS 2020 paper
8月23日 金康 Exploiting Bank Conflict-based Side-channel Timing Leakage of GPUs TACO paper
乔星涵 Performance Characterization of NVMe-over-Fabrics Storage Disaggregation TOS paper
8月30日 欧阳硕 DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression ICML 2019 paper
9月6日 汪杨海 Optimized Broadcast for Deep LearningWorkloads on Dense-GPU InfiniBand Clusters: MPI or NCCL? EuroMPI 2018 paper
9月13日 马骏超 A Scalable, High-Performance, and Fault-Tolerant Network Architecture for Distributed Machine Learning TON 2020 paper
9月20日 王育洋 Isolated Trees in Multi-Tenant Fat Tree Datacenters for In-Network Computing HotI 2020 paper
9月27日 胡鼎煌 Annulus: A Dual Congestion Control Loop for Datacenter and WAN Traffic Aggregates SIGCOMM 2020 paper
10月11日 吴克 PCF: Provably Resilient Flexible Routing SIGCOMM 2020 paper
周仁杰 Swift: Delay is Simple and Effective for Congestion Control in the Datacenter SIGCOMM 2020 paper
10月18日 王笑雨 Stay Fresh: Speculative Synchronization for Fast Distributed Machine Learning ICDCS 18 paper
10月25日 王长宏 An In-Depth Analysis of the Slingshot Interconnect SC 2020 paper
欧阳硕 Elastic Parameter Server Load Distribution in Deep Learning Clusters SoCC 2020
11月1日 黄山 Efficient Dynamic Isolation of Congestion in Lossless DataCenter Networks SIGCOMM 2019 paper
周泽嘉 One More Config is Enough: Saving (DC)TCP for High-speed Extremely Shallow-buffered Datacenters INFOCOM 2020 paper
11月8日 张晓云 Automated Performance Modeling of HPC Applications Using Machine Learning TC 2020 paper
马骏超 RAT - Resilient Allreduce Tree for Distributed Machine Learning APNet 2020 paper
于恩达 Communication-Efficient Distributed Deep Learning with Merged Gradient Sparsification on GPUs INFOCOM 2020 paper
11月15日 白洋 P4air: Increasing Fairness among Competing Congestion Control Algorithms ICNP 2020 paper
杨维铃 A Coordinated Tiling and Batching Framework for Efficient GEMM on GPUs PPoPP 2019 paper
司嘉奇 Gluon: A Communication-Optimizing Substrate for Distributed Heterogeneous Graph Analytics PLDI 2018 paper
11月22日 胡鼎煌 1RMA: Re-envisioning Remote Memory Access for Multi-tenant Datacenters SIGCOMM 2020 paper
王育洋 Architecture and performance studies of 3D-Hyper-FleX-LION for reconfigurable all-to-all HPC networks SC 2020 paper
黄泽彪 Taming Unbalanced Training Workloads in Deep Learning with Partial Collective Operations PPoPP 2020 paper
11月29日 吴克 TAGO: Rethinking Routing Design in High Performance Reconfigurable Networks SC 2020 paper
周煜琨 Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric ICPP 2020 paper
12月6日 王笑雨 Preemptive All-reduce Scheduling for Expediting Distributed DNN Training INFOCOM 2020 paper
袁郭苑 PINT: Probabilistic In-band Network Telemetry SIGCOMM 2020 paper
顾文豪 Shoal: A Network Architecture for Disaggregated Racks NSDI 2019 paper
12月13日 周泽嘉 RoCC: robust congestion control for RDMA CoNEXT 2020 paper
周仁杰 TINA: A Fair Inter-datacenter Transmission Mechanism with Deadline Guarantee INFOCOM 2020 paper
12月20日 王长宏 RLScheduler: An Automated HPC Batch Job Scheduler Using Reinforcement Learning SC 2020 paper
张宗茂 The gem5 Simulator SIGARCH paper
12月27日 张晓云 ANN Based Admission Control for On-Chip Networks DAC 2019 paper
王绍聪 Topology-custom UGAL routing on dragonfly SC 2019 paper