This portal provides links to various research papers on simultaneous multithreading. A more extensive (yet outdated) paper list can be found here: * [[http://tbp.berkeley.edu/~jdonald/research/hyperthreading][Simultaneous Multithreading resources @ UC Berkeley]] ---+++ General on Simultaneous Multithreading * _"Simultaneous Multithreading: A Platform for Next-Generation Processors"_. S. Eggers, J. Emer, H. Levy, J. Lo, R. Stamm, D. Tullsen. IEEE Micro, vol. 17, no. 5, *1997*. ([[ftp://ftp.cs.washington.edu/tr/1997/04/UW-CSE-97-04-02.PS.Z][ps]]) * _"ILP versus TLP on SMT"_. N. Mitchell, L. Carter, J. Ferrante, D. Tullsen. Proceedings of the ACM/IEEE conference on Supercomputing, *1999*. ([[http://www-cse.ucsd.edu/users/tullsen/ilpvtlp.ps][ps]]) ---+++ SMT Implementations * _"Hyper-Threading Technology Architecture and Microarchitecture"_. D. Marr, F. Binns, D. Hill, G. Hinton, D. Koufaty, J. Miller, M. Upton. Intel Technology Journal, vol.3, issue 1, *2002*. ([[http://download.intel.com/technology/itj/2002/volume06issue01/art01_hyper/vol6iss1_art01.pdf][pdf]]) * _"Initial Observations of the Simultaneous Multithreading Pentium 4 Processor"_. N. Tuck, D. Tullsen. Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques, *2003*. ([[http://www-cse.ucsd.edu/users/tullsen/pact03.pdf][pdf]]) * _"IBM Power5 Chip: A Dual-Core Multithreaded Processor"_. R. Kalla, B. Sinharoy, J. Tendler. IEEE Micro, vol. 24, no. 2, *2004*. ([[http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1289290&isnumber=28738][pdf]])([[http://][cslab]]) ---+++ Resource Sharing in SMTs * _"Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor"_. D. Tullsen, S. Eggers, J. Emer, H. Levy, J. Lo, R. Stamm. Proceedings of the 23th annual international symposium on Computer architecture, *1996*. ([[http://www-cse.ucsd.edu/users/tullsen/ISCA96.ps][ps]]) * _"Handling long-latency loads in a simultaneous multithreading processor"_. D. Tullsen, J. Brown. Proceedings of the 34th annual ACM/IEEE international symposium on Microarchitecture, *2001*. ([[http://www-cse.ucsd.edu/users/tullsen/smtload.pdf][pdf]]) * _"Front-End Policies for Improved Issue Efficiency in SMT Processors"_. A. Moursy, D. Albonesi. Proceedings of the 9th International Symposium on High-Performance Computer Architecture, *2003*. ([[http://www.ece.rochester.edu/~albonesi/research/papers/hpca03_1.pdf][pdf]]) * _"Dynamically Controlled Resource Allocation in SMT Processors"_. F. Cazorla, A. Ramirez, M. Valero, E. Fernandez. Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, *2004*. ([[http://personals.ac.upc.edu/fcazorla/articles/fcazorla_dcra2004.pdf][pdf]]) * _"Learning-Based SMT Processor Resource Distribution via Hill-Climbing"_. S. Choi, D. Yeung. Proceedings of the 33rd annual international symposium on Computer Architecture, *2006*. ([[http://maggini.eng.umd.edu/pub/2006-ISCA-Hill.pdf][pdf]]) * _"Software-Controlled Priority Characterization of POWER5 Processor"_. C. Boneti, F. Cazorla, R. Gioiosa, A. Buyuktosunoglu, C. Cher, M. Valero. SIGARCH Comput. Archit. News, ACM, 36, 415-426, *2008*. ([[http://personals.ac.upc.edu/fcazorla/articles/cboneti_isca_2008.pdf][pdf]]) ---+++ Helper Threading * _"Simultaneous subordinate microthreading (SSMT)"_. S. Chappell, J. Stark, S. Kim, S. Reinhardt, Y. Patt. Proceedings of the 26th annual international symposium on Computer architecture, *1999*. ([[http://www.eecs.umich.edu/~stever/pubs/isca99.pdf][pdf]]) * _"Tolerating Latency through Software-Controlled Pre-Execution in Simultaneous Multithreading Processors"_. C. Luk. Proceedings of the 28th Annual International Symposium on Computer Architecture, *2001*. ([[http://www.cs.cmu.edu/~luk/luk_papers/isca01.ps.gz][ps]]) * _"Speculative precomputation: long-range prefetching of delinquent loads"_. J. Collins, H. Wang, D. Tullsen, C. Hughes, Y. Lee, D. Lavery, J. Shen. Proceedings of the 28th annual international symposium on Computer architecture, *2001*. ([[http://www-cse.ucsd.edu/users/tullsen/isca2001.pdf][pdf]]) * _"Speculative Precomputation: Exploring the Use of Multithreading for Latency"_. H. Wang, P. Wang, R. Weldon, S. Ettinger, H. Saito, M. Girkar, S. Liao, J. Shen. Intel Technology Journal, vol.3, issue 1, *2002*. ([[http://download.intel.com/technology/itj/2002/volume06issue01/art03_specprecomp/vol6iss1_art03.pdf][pdf]]) * _"Transparent Threads: Resource Sharing in SMT Processors for High Single-Thread Performance"_. G. Dorai, D. Yeung. Proceedings of the 11th international conference on Parallel architectures and compilation techniques, *2002*. ([[http://maggini.eng.umd.edu/pub/dorai212.pdf][pdf]]) * _"Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors"_. D. Kim, J. Shen, S. Liao, P. Wang, J. Cuvillo, X. Tian, X. Zou, H. Wang, D. Yeung, M. Girkar. Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization, *2004*. ([[http://maggini.eng.umd.edu/pub/pre-exec-cgo2004.pdf][pdf]]) ---+++ Synchronization * _"Supporting Fine-Grained Synchronization on a Simultaneous Multithreading Processor"_. D. Tullsen, J. Lo, S. Eggers, H. Levy. Proceedings of the IEEE 5th International Symposium on High Performance Computer Architecture, *1999*. ([[http://www-cse.ucsd.edu/users/tullsen/HPCA5synch.ps][ps]]) * _"Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers"_. J. Sampson, R. Gonzalez, J. Collard, N. Jouppi, M. Schlansker, B. Calder. Proceedings of the International Symposium on Microarchitecture, *2006*. ([[http://www.cse.ucsd.edu/~calder/papers/MICRO-06-Barriers.pdf][pdf]]) ---+++ Job Scheduling * _"Symbiotic job scheduling with Priorities for a Simultaneous Multithreading Processor"_. A. Snavely, D. Tullsen, G. Voelker. Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, *2002*. ([[http://www-cse.ucsd.edu/users/tullsen/sigmet02.pdf][pdf]]) * _"Architectural Support for Enhanced SMT Job Scheduling"_. A. Settle, J. Kihm, A. Janiszewski, D. Connors. Proceedings of the 13th international conference on Parallel architectures and compilation techniques, *2004*. ([[http://rogue.colorado.edu/draco/papers/pact04-smt.pdf][pdf]]) * _"Scheduling Algorithms for Effective Thread Pairing on Hybrid Multiprocessors"_. R. McGregor, C. Antonopoulos, D. Nikolopoulos. Proceedings of the 19th International Symposium on Parallel and Distributed Processing, *2005*. ([[http://people.cs.vt.edu/~dsn/papers/IPDPS_2005.pdf][pdf]]) * _"Hyper-threading aware process scheduling heuristics"_. J. Bulpin, I. Patt. Proceedings of the USENIX Annual Technical Conference 2005 on USENIX Annual Technical Conference, *2005*. ([[http://www.cl.cam.ac.uk/~jrb44/docs/bulpin_usenix05.pdf][pdf]]) * _"Dynamic run-time architecture techniques for enabling continuous optimization"_. T. Moseley, A. Shye, V. Reddi, M. Iyer, D. Fay, D. Hodgdon, J. Kihm, A. Settle, D. Grunwald, D. Connors. Proceedings of the 2nd conference on Computing frontiers, *2005*. ([[http://rogue.colorado.edu/draco/papers/cf05-draco.pdf][pdf]]) * _"Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors"_. D. Tam, R. Azimi, M. Stumm. Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, *2007*. ([[http://www.eecg.utoronto.ca/~tamda/papers/threadclustering.pdf][pdf]]) ---+++ Code Optimizations and Runtime Techniques * _"Tuning compiler optimizations for simultaneous multithreading"_. J. Lo, S. Eggers, H. Levy, S. Parekh, D. Tullsen. Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, *1997*. ([[http://www-cse.ucsd.edu/users/tullsen/smtcompiler.ps][ps]]) * _"Maximizing TLP with loop-parallelization on SMT"_. D. Puppin, D. Tullsen. 5th Workshop on Multithreaded Execution, Architecture, and Compilation, *2001*. ([[http://cag.lcs.mit.edu/commit/papers/01/diego_SMT_MTEAC.ps][ps]]) * _"Code and Data Transformations for Improving Shared Cache Performance on SMT Processors"_. D. Nikolopoulos. ISHPC, pp. 54-69, 2003. ([[http://people.cs.vt.edu/~dsn/papers/ISHPC_2003_corrected.pdf][pdf]]) * _"Runtime support for integrating precomputation and thread-level parallelism on simultaneous multithreaded processors"_. M. Curtis-Maury, T. Wang, C. Antonopoulos, D. Nikolopoulos. Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems, *2004*. ([[http://people.cs.vt.edu/~dsn/papers/LCR_2004.pdf][pdf]]) * _"Runtime Empirical Selection of Loop Schedulers on Hyperthreaded SMPs"_. Y. Zhang, M. Voss. Proceedings of the 19th International Symposium on Parallel and Distributed Processing, *2005*. ([[http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1419864&isnumber=30685][pdf]])([[http://][cslab]]) * _"Integrating Multiple Forms of Multithreaded Execution on multi-SMT Systems: A Study with Scientific Applications"_. M. Curtis-Maury, T. Wang, C. Antonopoulos, D. Nikolopoulos. Proceedings of the Second International Conference on the Quantitative Evaluation of Systems, *2005*. ([[http://people.cs.vt.edu/~dsn/papers/QEST_2005.pdf][pdf]]) * _"Dynamic tiling for effective use of shared caches on multithreaded processors"_. D. Nikolopoulos. International Journal of High Performance Computing and Networking, Vol. 2, pp. 22-35, *2006*. ([[http://people.cs.vt.edu/~dsn/papers/IJHPCN_2004.pdf][pdf]]) ---+++ Applications * _"Database hash-join algorithms on multithreaded computer architectures"_. P. Garcia, H. Korth. Proceedings of the 3rd conference on Computing frontiers, *2006*. ([[http://portal.acm.org/ft_gateway.cfm?id=1128055&type=pdf&coll=GUIDE&dl=GUIDE&CFID=5375546&CFTOKEN=25291073][pdf]])([[http://][cslab]]) * _"Multigrain parallel Delaunay Mesh generation: challenges and opportunities for multithreaded architectures"_. C. Antonopoulos, X. Ding, A. Chernikov, F. Bagojevic, D. Nikolopoulos, N. Chrisochoides. Proceedings of the 19th annual international conference on Supercomputing, *2005*. ([[http://people.cs.vt.edu/~dsn/papers/ICS_2005.pdf][pdf]]) * _"Stream Programming on General-Purpose Processors"_. J. Gummaraju, M. Rosenblum. Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture, *2005*. ([[http://merrimac.stanford.edu/publications/micro38_streamingGPP.pdf][pdf]]) ---+++ SMT Extensions and Variations * _"Mini-Threads: Increasing TLP on Small-Scale SMT Processors"_. J. Redstone, S. Eggers, H. Levy. Proceedings of the IEEE 9th International Symposium on High Performance Computer Architecture, *2003*. ([[http://www.cs.washington.edu/research/smt/papers/minithreads.pdf][pdf]]) * _"Balanced Multithreading: Increasing Throughput via a Low Cost Multithreading Hierarchy"_. E. Tune, R. Kumar, D. Tullsen, B. Calder. Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture, *2004*. ([[http://www-cse.ucsd.edu/users/tullsen/balancedmt.pdf][pdf]])
This topic: CSLab
>
WebHome
>
BibPortal
>
BibSMT
Topic revision: r4 - 2008-10-08 - NikosAnastopoulos
No permission to view
TWiki.WebBottomBar