Steve Lumetta's Thesis Bibliography
(BibTeX source file)
-
N. S. Arora, R. D. Blumofe, and C. G. Plaxton, "Thread Scheduling for
Multiprogrammed Multiprocessors," In Proceedings of the 10th Symposium on
Parallel Algorithms and Architectures, pp. 119-29, Puerto Vallarta,
Mexico, June 1998.
-
T. E. Anderson, D. E. Culler, D. A. Patterson, and the NOW Team, "A Case for
NOW (Networks of Workstations)," IEEE Micro, 15(1):54-64, February
1995.
-
B. S. Ang, D. Chiou, D. L. Rosenband, M. Ehrlich, L. Rudolph, and Arvind,
"StarT-Voyager: A Flexible Platform for Exploring Scalable SMP Issues," In
Proceedings of SC98: High Performance Networking and Computing,
Orlando, Florida, November 1998.
-
A. C. Arpaci-Dusseau, Implicit Coscheduling: Coordinated Scheduling with
Implicit Information in Distributed Systems, PhD thesis, University of
California at Berkeley, December 1998.
-
A. C. Arpaci-Dusseau, D. E. Culler, and A. M. Mainwaring, "Scheduling with
Implicit Information in Distributed Systems," In Proceedings of the
SIGMETRICS'98/PERFORMANCE'98 Joint International Conference on Measurement
and Modeling of Computer Systems, pp. 233-43, Madison, Wisconsin, June
1998.
-
R. H. Arpaci, A. C. Dusseau, A. H. Vahday, L. T. Liu, T. E. Anderson, and D. A.
Patterson, "The Interaction of Parallel and Sequential Workloads on a Network
of Workstations," In Proceedings of SIGMETRICS Joint International
Conference on Measurement and Modeling of Computer Systems, pp. 267-78,
Ottawa, Canada, May 1995.
-
A. Alexandrov, M. Ionescu, K. E. Schauser, and C. Scheiman, "LogGP:
Incorporating Long Messages into the LogP Model--One Step Closer towards a
Realistic Model for Parallel Computation," In Proceedings of the 7th
Symposium on Parallel Algorithms and Architectures, pp. 95-105, Santa
Barbara, California, July 1995.
-
T. E. Anderson, "The Performance of a Spin Lock Alternative for Shared-Memory
Multiprocessors," IEEE Transactions on Parallel and Distributed
Systems, 1(1):6-16, January 1990.
-
E. Anderson and D. A. Patterson, "Extensible, Scalable Monitoring for Clusters
of Computers," In Proceedings of the 11th Systems Administration
Conference (LISA'97), pp. 9-16, San Diego, California, October 1997.
-
T. Brewer and G. Astfalk, "The Evolution of the HP/Convex Exemplar," In
Proceedings of COMPCON Spring '97, pp. 81-6, San Jose, California,
February 1997.
-
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N.
Seizovic, and W. Su, "Myrinet--A Gigabit-per-Second Local-Area Network,"
IEEE Micro, 15(1):29-38, February 1995.
-
E. A. Brewer, F. T. Chong, L. T. Liu, S. D. Sharma, and J. D. Kubiatowicz,
"Remote Queues: Exposing Message Queues for Optimization and Atomicity," In
Proceedings of the 7th Symposium on Parallel Algorithms and
Architectures, pp. 42-53, Santa Barbara, California, July 1995.
-
M. A. Blumrich, C. Dubnicki, E. W. Felten, K. Li, and M. R. Mesarina,
"Virtual-Memory-Mapped Network Interfaces," IEEE Micro,
15(1):21-28, February 1995.
-
D. A. Bader and J. JáJá, "SIMPLE: A Methodology for Programming High
Performance Algorithms on Clusters of Symmetric Multiprocessors (SMP's)," May
1997.
(preliminary version)
-
E. A. Brewer and B. C. Kuszmaul, "How to Get Good Performance from the CM-5
Data Network," In Proceedings of the 8th International Parallel
Processing Symposium, pp. 858-67, Cancun, Mexico, April 1994.
-
R. Butler and E. Lusk, "Monitors, Message, and Clusters: the p4 Parallel
Programming System," Parallel Computing, 20(4):547-64, April 1994.
-
M. A. Blumrich, K. Li, R. Alpert, C. Dubnicki, E. W. Felten, and J. Sandberg,
"Virtual-Memory-Mapped Network Interface for the SHRIMP Multicomputer," In
Proceedings of the 21st International Symposium on Computer
Architecture, pp. 142-53, Chicago, Illinois, April 1994.
-
G. Blelloch, C. Leiserson, and B. Maggs, "A Comparison of Sorting Algorithms
for the Connection Machine CM-2," In Proceedings of the 3rd Symposium on
Parallel Algorithms and Architectures, pp. 3-16, Hilton Head, South
Carolina, July 1991.
-
R. D. Blumofe and D. Papadopoulos, "The Performance of Work Stealing in
Multiprogrammed Environments," University of Texas at Austin Technical Report
CS-TR-98-13, May 1998.
-
G. T. Byrd, "Models of Communication Latency in Shared Memory Multiprocessors,"
Stanford University Technical Report CSL-TR-93-596, December 1993.
-
K. Connelly and A. A. Chien, "FM-QoS: Real-time Communication using
Self-synchronizing Schedules," In Proceedings of SC97: High Performance
Networking and Computing, San Jose, California, November 1997.
-
D. E. Culler, A. Dusseau, S. C. Goldstein, A. Krishnamurthy, S. S. Lumetta, T.
von Eicken, and K. Yelick, "Parallel Programming in Split-C," In
Proceedings of Supercomputing 1993, pp. 262-73, Portland, Oregon,
November 1993.
-
D. E. Culler, A. C. Dusseau, R. P. Martin, and K. E. Schauser, Fast
Parallel Sorting under LogP: from Theory to Practice, chapter 4, pp.
71-98.
John Wiley and Sons, 1993.
-
A. L. Cox and R. J. Fowler, "Adaptive Cache Coherency for Detecting Migratory
Shared Data," In Proceedings of the 20th International Symposium on
Computer Architecture, pp. 98-108, San Diego, California, May 1993.
-
D. R. Cheriton and R. A. Kutter, "Optimized Memory-Based Messaging: Leveraging
the Memory System for High-Performance Communication," Computing
Systems, 9(3):179-215, 1996.
-
D. E. Culler, R. M. Karp, D. A. Patterson, A. Sahay, K. E. Schauser, E. Santos,
R. Subramonian, and T. von Eicken, "LogP: Towards a Realistic Model of
Parallel Computation," In Proceedings of the 4th Symposium on Principles
and Practice of Parallel Programming, San Diego, California, May 1993.
-
D. E. Culler, L. T. Liu, R. P. Martin, and C. O. Yoshikawa, "Assessing Fast
Network Interfaces," IEEE Micro, 16(1):35-43, February 1996.
-
B. N. Chun, A. M. Mainwaring, and D. E. Culler, "Virtual Network Transport
Protocols for Myrinet," IEEE Micro, 18(1):53-63, January 1998.
-
Sun Microsystems Computer Corporation, "UltraSPARC-I Data Sheet: First
Generation SPARC v9 64-Bit Microprocessor with VIS," July 1997.
Document #STP1030A.
(available
online)
-
International Business Machines Corporation, "S/390 Parallel Enterprise Server
and OS/390 Reference Guide," May 1998.
-
A. Charlesworth, A. Phelps, R. Williams, and G. Gilbert, "Gigaplane-XB:
Extending the Ultra Enterprise Family," In Proceedings of Hot
Interconnects V, pp. 97-112, Stanford, California, August 1997.
-
J. Chapin, M. Rosenblum, S. Devine, T. Lahiri, D. Teodosiu, and A. Gupta,
"Hive: Fault Containment for Shared-Memory Multiprocessors," In
Proceedings of the 15th Symposium on Operating Systems Principles,
pp. 12-25, Copper Mountain Resort, Colorado, December 1995.
-
D. E. Culler, J. P. Singh, and with A. Gupta, Parallel Computer
Architecture: A Hardware/Software Approach. Morgan Kaufmann, 1998.
-
F. Douglis and J. K. Ousterhout, "Process Migration in the Sprite Operating
System," In Proceedings of the 7th International Conference on
Distributed Computing Systems, pp. 18-25, Berlin, West Germany,
September 1987.
-
"Accelerated Strategic Computing Initiative, a program of the U. S. Department
of Energy."
(information available
online)
-
S. J. Fink and S. B. Baden, "Partitioning of Finite Difference Methods Running
on SMP Clusters."
(available online)
-
S. J. Fink and S. B. Baden, "Runtime Support for Multi-Tier Programming of
Block-Structured Applications on SMP Clusters," In Y.~Ishikawa~et al.,
editor, Scientific Computing in Object-Oriented Parallel
Environments, Vol. 1343 of Lecture Notes in Computer Science,
pp. 1-8, Springer-Verlag, 1997.
(available online)
-
I. Foster, J. Geisler, C. Kesselman, and S. Tuecke, "Managing Multiple
Communication Methods in High-Performance Networked Computing Systems," In
Journal of Parallel and Distributed Computing, Vol. 40, pp. 35-48,
January 1997.
-
I. Foster, C. Kesselman, and S. Tuecke, "The Nexus Approach to Integrating
Multithreading and Communication," Journal of Parallel and Distributed
Computing, 37:70-82, August 1996.
-
M. Frigo, C. E. Leiserson, and K. H. Randall, "The Implementation of the Cilk-5
Multithreaded Language," In Proceedings of Programming Language Design
and Implementation, pp. 212-23, Montreal, Quebec, Canada, June 1998.
-
B. Falsafi and D. A. Wood, "Scheduling Communication on an SMP Node Parallel
Machine," In Proceedings of the 3rd International Symposium on
High-Performance Computer Architecture, pp. 128-38, San Antonio, Texas,
February 1997.
-
M. Greenwald and D. Cheriton, "The Synergy between Non-Blocking Synchronization
and Operating System Structure," In Proceedings of the 2nd Symposium on
Operating Systems Design and Implementation, pp. 123-36, Seattle,
Washington, October 1996.
-
K. Ghosh and A. J. Christie, "Communication Across Fault-Containment Firewalls
on the SGI Origin," In Proceedings of the 4th International Symposium on
High-Performance Computer Architecture, pp. 277-87, Las Vegas, Nevada,
February 1998.
-
R. B. Gillett and R. Kaufmann, "Using the Memory Channel Network," IEEE
Micro, 17(1):19-25, February 1997.
-
W. W. Gropp and E. L. Lusk, "A Taxonomy of Programming Models for Symmetric
Multiprocessors and SMP clusters," In Proceedings of Programming Models
for Massively Parallel Computers 1995, pp. 2-7, Berlin, Germany,
October 1995.
-
K. Gharachorloo, D. Lenoski, J. Laudon, P. Gibbons, A. Gupta, and J. L.
Hennessy, "Memory Consistency and Event Ordering in Scalable Shared-Memory
Multiprocessors," In Proceedings of the 17th International Symposium on
Computer Architecture, pp. 15-26, Seattle, Washington, May 1990.
-
A. Gottlieb, B. D. Lubachevsky, and L. Rudolph, "Basic Techniques for the
Efficient Coordination of Very Large Numbers of Cooperating Sequential
Processors," ACM Transactions on Programming Languages and Systems,
5(2):164-89, April 1983.
-
S. C. Goldstein, Lazy Threads: Compiler and Runtime Structures for
Fine-Grained Parallel Programming, PhD thesis, University of California
at Berkeley, June 1997.
-
D. P. Ghormley, D. Petrou, S. H. Rodrigues, A. M. Vahdat, and T. E. Anderson,
"GLUnix: a Global Layer Unix for a Network of Workstations,"
Software--Practice and Experience, 28(9):929-61, July 1998.
-
M. Haines, D. Cronk, and P. Mehrotra, "On the Design of Chant: A Talking
Threads Package," In Proceedings of Supercomputing 1994, pp. 350-9,
Washington, D.C., November 1994.
-
M. P. Herlihy, "Impossibility and Universality Results for Wait-Free
Synchronization," In Proceedings of the 7th Symposium on Principles of
Distributed Computing, pp. 276-290, Toronto, Ontario, Canada, August
1988.
-
M. Herlihy, "A Methodology for Implementing Highly Concurrent Data Objects,"
ACM Transactions on Programming Languages and Systems,
15(5):745-70, November 1993.
-
J. Heinlein, K. Gharachorloo, S. Dresser, and A. Gupta, "Integration of Message
Passing and Shared Memory in the Stanford FLASH Multiprocessor," In
Proceedings of the 6th International Conference on Architectural Support
for Programming Languages and Operating Systems, pp. 38-50, San Jose,
California, November 1994.
-
P. J. Husbands and J. C. Hoe, "MPI-StarT: Delivering Network Performance to
Numerical Applications," In Proceedings of SC98: High Performance
Networking and Computing, Orlando, Florida, November 1998.
-
M. D. Hill, J. R. Larus, S. K. Reinhardt, and D. A. Wood, "Cooperative Shared
Memory: Software and Hardware for Scalable Multiprocessors," In
Proceedings of the 5th International Conference on Architectural Support
for Programming Languages and Operating Systems, pp. 262-73, Boston,
Massachusetts, October 1992.
-
M. P. Herlihy and J. M. Wing, "Linearizability: A Correctness Condition for
Concurrent Objects," ACM Transactions on Programming Languages and
Systems, 12(3):463-92, July 1990.
-
"IBM S/390: The Defining Standard of Enterprise Computing, Today and Tomorrow,
International Business Machines Corporation," Document #GF22-5043-00., May
1998.
(available
online)
-
L. Iftode, C. Dubnicki, E. W. Felten, and K. Li, "Improving Release-Consistent
Shared Virtual Memory Using Automatic Update," In Proceedings of the 2nd
International Symposium on High-Performance Computer Architecture, pp.
14-25, San Jose, California, February 1996.
-
G. Iannello, M. Lauria, and S. Mercolino, "LogP Performance Characterization of
Fast Messages atop Myrinet," In Proceedings of the Sixth Euromicro
Workshop on Parallel and Distributed Processing, pp. 395-401, Madrid,
Spain, January 1998.
-
L. Iftode, J. P. Singh, and K. Li, "Scope Consistency: A Bridge between Release
Consistency and Entry Consistency," In Proceedings of the 8th Symposium
on Parallel Algorithms and Architectures, pp. 277-87, Padua, Italy,
June 1996.
-
L. Iftode, J. P. Singh, and K. Li, "Understanding Application Performance on
Shared Virtual Memory Systems," In Proceedings of the 23rd International
Symposium on Computer Architecture, pp. 122-33, Philadelphia,
Pennsylvania, May 1996.
-
Morris A. Jette, "Performance Characteristics of Gang Scheduling in
Multiprogrammed Environments," In Proceedings of SC97: High Performance
Networking and Computing, San Jose, California, November 1997.
-
C. F. Joerg and D. S. Henry, "A Tightly Coupled Processor-Network Interface,"
In Proceedings of the 5th International Conference on Architectural
Support for Programming Languages and Operating Systems, pp. 111-22,
Boston, Massachusetts, October 1992.
-
D. Jiang, H. Shan, and J. P. Singh, "Application Restructuring and Performance
Portability on Shared Virtual Memory and Hardware-Coherent Multiprocessors,"
In Proceedings of the 6th Symposium on Principles and Practice of
Parallel Programming, pp. 217-29, Las Vegas, Nevada, June 1997.
-
V. Karamcheti and A. A. Chien, "A Comparison of Architectural Support for
Messaging in the TMC CM-5 and the Cray T3D," In Proceedings of the 22nd
International Symposium on Computer Architecture, pp. 298-307, Santa
Margherita Ligure, Italy, June 1995.
-
P. Keleher, A. L. Cox, and W. Zwaenepoel, "Lazy Release Consistency for
Software Distributed Shared Memory," In Proceedings of the 19th
International Symposium on Computer Architecture, pp. 13-21, Gold
Coast, Queensland, Australia, May 1992.
-
D. Kuck, E. Davidson, D. Lawrie, A. Sameh, C.-Q. Zhu, A. Veidenbaum, J.
Konicek, P. Yew, K. Gallivan, W. Jalby, H. Wijshoff, R. Bramley, U. M. Yang,
P. Emrath, D. Padua, R. Eigenmann, J. Hoeflinger, G. Jaxon, Z. Li, T. Murphy,
J. Andrews, and S. Turner, "The Cedar System and an Initial Performance
Study," In Proceedings of the 20th International Symposium on Computer
Architecture, pp. 213-23, San Diego, California, May 1993.
-
F. P. Kelly, Reversability and Stochastic Networks. John Wiley and
Sons Ltd., 1979.
-
L. Kontothanassis, G. Hunt, R. Stets, N. Hardavellas, M. Cierniak, S.
Parthasarathy, W. Meira, Jr., S. Dwarkadas, and M. Scott, "VM-Based Shared
Memory on Low-Latency, Remote-Memory-Access Networks," In Proceedings of
the 24th International Symposium on Computer Architecture, pp. 157-69,
Denver, Colorado, June 1997.
-
A. Krishnamurthy, S. S. Lumetta, D. E. Culler, and K. Yelick, "Connected
Components on Distributed Memory Machines," In Sandeep Bhatt, editor,
Parallel Algorithms: Third DIMACS Implementation Challenge, Vol. 30
of DIMACS Series in Discrete Mathematics and Theoretical Computer
Science, pp. 1-21, 1997.
-
C. P. Kruskal, L. Rudolph, and M. Snir, "Efficient Synchronization on
Multiprocessors with Shared Memory," In Proceedings of the 5th Symposium
on Principles of Distributed Computing, pp. 218-28, Calgary, Alberta,
Canada, August 1986.
-
J. D. Kubiatowicz, Integrated Shared-Memory and Message-Passing
Communication in the Alewife Multiprocessor, PhD thesis, Massachusetts
Institute of Technology, February 1998.
-
A. Krishnamurthy and K. Yelick, "Analyses and Optimizations for Shared Address
Space Programs," Journal of Parallel and Distributed Computing,
38(2):130-44, November 1996.
-
L. T. Liu and D. E. Culler, "Evaluation of the Intel Paragon on Active Message
Communication," In Proceedings of the Intel Supercomputer Users Group
Conference, June 1995.
(available via the NOW project)
-
T. Lovett and R. Clapp, "STiNG: A CC-NUMA Computer System for the Commercial
Marketplace," In Proceedings of the 23rd International Symposium on
Computer Architecture, pp. 308-17, Philadelphia, Pennsylvania, May
1996.
-
S. S. Lumetta and D. E. Culler, "Managing Concurrent Access for Shared Memory
Active Messages," In Proceedings of the International Parallel Processing
Symposium, pp. 272-8, Orlando, Florida, March 1998.
-
K. Li and P. Hudak, "Memory Coherence in Shared Virtual Memory Systems," In
Proceedings of the 5th Symposium on Principles of Distributed
Computing, pp. 229-39, Calgary, Alberta, Canada, August 1986.
-
B.-H. Lim, P. Heidelberger, P. Pattnaik, and M. Snir, "Message Proxies for
Efficient, Protected Communication on SMP Clusters," In Proceedings of
the 3rd International Symposium on High-Performance Computer
Architecture, pp. 116-27, San Antonio, Texas, February 1997.
-
S. S. Lumetta, A. Krishnamurthy, and D. E. Culler, "Towards Modeling the
Performance of a Fast Connected Components Algorithm on Parallel Machines,"
In Proceedings of Supercomputing 1995, San Diego, California,
December 1995.
-
M. J. Litzkow and M. Livny, "Making Workstations a Friendly Environment for
Batch Jobs," In Proceedings of the 3rd Workshop on Workstation Operating
Systems, pp. 62-7, Key Biscayne, Florida, April 1992.
-
J. Laudon and D. E. Lenoski, "The SGI Origin: A ccNUMA Highly Scalable Server,"
In Proceedings of the 24th International Symposium on Computer
Architecture, pp. 241-51, Denver, Colorado, June 1997.
-
D. E. Lenoski, J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy, "The
Directory-Based Cache Coherence Protocol for the DASH Multiprocessor," In
Proceedings of the 17th International Symposium on Computer
Architecture, pp. 148-59, Seattle, Washington, May 1990.
-
M. J. Litzkow, M. Livny, and M. W. Mutka, "Condor--A Hunter of Idle
Workstations," In Proceedings of the 8th International Conference on
Distributed Computing Systems, pp. 104-11, San Jose, California, June
1988.
-
S. S. Lumetta, A. M. Mainwaring, and D. E. Culler, "Multi-Protocol Active
Messages on a Cluster of SMP's," In Proceedings of SC97: High Performance
Networking and Computing, San Jose, California, November 1997.
-
N. K. Madsen, "Divergence Preserving Discrete Surface Integral Methods for
Maxwell's Curl Equations Using Non-Orthogonal Unstructured Grids," NASA RIACS
Technical Report 92.04, February 1992.
-
R. P. Martin, "HPAM: an Active Message Layer for a Network of HP Workstations,"
In Proceedings of Hot Interconnects II, pp. 40-58, August 1994.
-
A. M. Mainwaring and D. E. Culler, "Active Message Applications Programming
Interface and Communication Subsystem Organization," U. C. Berkeley Technical
Report #CSD-96-918, October 1996.
-
J. M. Mellor-Crummey and M. L. Scott, "Algorithms for Scalable Synchronization
on Shared-Memory Multiprocessors," ACM Transactions on Computer
Systems, 9(1):21-65, Feb. 1991.
-
A. M. Mainwaring, B. N. Chun, S. Schleimer, and D. Wilkerson, "System Area
Network Mapping," In Proceedings of the 9th Symposium on Parallel
Algorithms and Architectures, pp. 116-26, Newport, Rhode Island, May
1997.
-
S. S. Mukherjee and M. D. Hill, "A Case for Making Network Interfaces Less
Peripheral," In Proceedings of Hot Interconnects V, pp. 21-6,
Stanford, California, August 1997.
-
H. Massalin and C. Pu, "A Lock-free Multiprocessor OS Kernel," Columbia
University Technical Report CUCS-005-91, June 1991.
-
J. E. Moreira and C. D. Polychronopoulos, "Autoscheduling in a Distributed
Shared-Memory Environment," In K.~Pingali~et al., editor,
Proceedings of the 7th International Workshop on Languages and Compilers
for Parallel Computing, Vol. 892 of Lecture Notes in Computer
Science, pp. 453-69, Springer-Verlag, 1994.
-
"MPI-2: Extensions to the Message-Passing Interface," July 1997.
(available from the Message Passing
Interface Forum)
-
M. M. Michael and M. L. Scott, "Simple, Fast, and Practical Non-Blocking and
Blocking Concurrent Queue Algorithms," In Proceedings of the 15th
Symposium on Principles of Distributed Computing, pp. 267-75,
Philadelphia, Pennsylvania, May 1996.
-
M. M. Michael and M. L. Scott, "Relative Performance of Preemption-Safe Locking
and Non-Blocking Synchronization on Multiprogrammed Shared Memory
Multiprocessors," In Proceedings of the 11th International Parallel
Processing Symposium, pp. 267-73, Geneva, Switzerland, April 1997.
-
R. P. Martin, A. M. Vahdat, D. E. Culler, and T. E. Anderson, "Effects of
Communication Latency, Overhead, and Bandwidth in a Cluster Architecture," In
Proceedings of the 24th International Symposium on Computer
Architecture, pp. 85-97, Denver, Colorado, June 1997.
-
G. Narlikar and G. Blelloch, "Pthreads for Dynamic and Irregular Parallelism,"
In Proceedings of SC98: High Performance Networking and Computing,
Orlando, Florida, November 1998.
-
J. M. Nick, B. B. Moore, J.-Y. Chung, and N. S. Bowen, "S/390 Cluster
Technology: Parallel Sysplex," IBM Systems Journal, 36(2):172-202,
1997.
-
J. K. Ousterhout, "Scheduling Techniques for Concurrent Systems," In
Proceedings of the 3rd International Conference on Distributed Computing
Systems, pp. 22-30, Miami/Fort Lauderdale, Florida, October 1982.
-
D. A. Patterson, T. E. Anderson, N. Cardwell, R. Fromm, K. Keeton, C.
Kozyrakis, R. Thomas, and K. Yelick, "A Case for Intelligent RAM: IRAM,"
IEEE Micro, 17(2):29-38, April 1997.
-
G. F. Pfister, In Search of Clusters: The Ongoing Battle in Lowly Parallel
Computing. Prentice Hall, 2nd edition, 1998.
-
D. A. Patterson and J. L. Hennessy, Computer Organization and Design: The
Hardware/Software Interface. Morgan Kaufmann, 2nd edition, 1998.
-
S. Pakin, M. Lauria, and A. A. Chien, "High Performance Messaging on
Workstations: Illinois Fast Messages (FM) for Myrinet," In Proceedings of
Supercomputing 1995, San Diego, California, December 1995.
-
M. S. Papamarcos and J. H. Patel, "A Low-Overhead Coherence Solution for
Multiprocessors with Private Cache Memories," In Proceedings of the 11th
International Symposium on Computer Architecture, pp. 348-54, Ann
Arbor, Michigan, June 1984.
-
L. Prylli and B. Tourancheau, "BIP: A New Protocol Designed for High
Performance Networking on Myrinet," In José Rolim, editor, Parallel
and Distributed Processing, Vol. 1388 of Lecture Notes in Computer
Science, pp. 472-85, Springer-Verlag, 1998.
-
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery,
Numerical Recipes in C: The Art of Scientific Computing. Cambridge
University Press, 2nd edition, January 1993.
-
D. P. Reed and R. K. Kanodia, "Synchronization with Eventcounts and
Sequencers," Communications of the ACM, 22(2):115-23, February
1979.
-
S. K. Reinhardt, J. R. Larus, and D. A. Wood, "Tempest and Typhoon: User-Level
Shared Memory," In Proceedings of the 21st International Symposium on
Computer Architecture, pp. 325-36, Chicago, Illinois, April 1994.
-
L. Rudolph, Software Structures for Ultraparallel Computing, PhD
thesis, New York University, December 1981.
-
R. H. Saavedra, "Micro Benchmark Analysis of the KSR1," In Proceedings of
Supercomputing 1993, pp. 202-13, Portland, Oregon, November 1993.
-
E. Speight and J. K. Bennett, "Brazos: A Third Generation DSM System," In
Proceedings of the USENIX Windows NT Workshop, pp. 95-106, Seattle,
Washington, August 1997.
-
E. Speight and J. K. Bennett, "Using Multicast and Multithreading to Reduce
Communication in Software DSM Systems," In Proceedings of the 4th
International Symposium on High-Performance Computer Architecture, pp.
312-22, Las Vegas, Nevada, February 1998.
-
A. Singhal, D. Broniarczyk, F. Cerauskis, J. Price, L. Yuan, C. Cheng, D.
Doblar, S. Fosth, N. Agarwal, K. Harvey, E. Hagersten, and B. Liencres,
"Gigaplane: A High Performance Bus for Large SMPs," In Proceedings of Hot
Interconnects IV, pp. 41-52, Stanford, California, August 1996.
-
P. Stenström, M. Brorsson, and L. Sandberg, "An Adaptive Cache Coherence
Protocol Optimized for Migratory Sharing," In Proceedings of the 20th
International Symposium on Computer Architecture, pp. 109-18, San
Diego, California, May 1993.
-
M. L. Scott and A. L. Cox, "An Empirical Study of Message-Passing Overhead," In
Proceedings of the 7th International Conference on Distributed Computing
Systems, pp. 536-43, Berlin, West Germany, September 1987.
-
R. Stets, S. Dwarkadas, N. Hardavellas, G. Hunt, L. Kontothanassis, S.
Parthasarathy, and M. Scott, "Cashmere-2L: Software Coherent Shared Memory on
a Clustered Remote-Write Network," In Proceedings of the 16th Symposium
on Operating Systems Principles, pp. 170-83, Saint Malo, France,
October 1997.
-
D. J. Scales, K. Gharachorloo, and A. Aggarwal, "Fine-Grain Software
Distributed Shared Memory on SMP Clusters," In Proceedings of the 4th
International Symposium on High-Performance Computer Architecture, pp.
125-36, Las Vegas, Nevada, February 1998.
Also available as DEC WRL Research Report 97/3.
-
D. J. Scales, K. Gharachorloo, and C. Thekkath, "Shasta: A Low Overhead,
Software-Only Approach for Supporting Fine-Grain Shared Memory," In
Proceedings of the 7th International Conference on Architectural Support
for Programming Languages and Operating Systems, pp. 174-85, Cambridge,
Massachusetts, October 1996.
Also available as DEC WRL Research Report 96/2.
-
K. E. Schauser and C. Scheiman, "Experience with Active Messages on the Meiko
CS-2," In Proceedings of the 9th International Parallel Processing
Symposium, pp. 140-9, Santa Barbara, California, April 1995.
-
D. Teodosiu, J. Baxter, K. Govil, J. Chapin, M. Rosenblum, and M. Horowitz,
"Hardware Fault Containment in Scalable Shared-Memory Multiprocessors," In
Proceedings of the 24th International Symposium on Computer
Architecture, pp. 73-84, Denver, Colorado, June 1997.
-
L. Tucker and A. M. Mainwaring, "CMMD: Active Messages on the CM-5,"
Parallel Computing, 20(4):481-96, August 1994.
-
L. G. Valiant, "A Bridging Model for Parallel Computation," Communications
of the ACM, 33(8):103-11, August 1990.
-
J. D. Valois, "Implementing Lock-Free Queues," In Proceedings of 7th
International Conference on Parallel and Distributed Computing Systems,
pp. 64-9, Las Vegas, Nevada, October 1994.
-
T. von Eicken, V. Avula, A. Basu, and V. Buch, "Low-latency Communication over
ATM Networks Using Active Messages," IEEE Micro, 15(1):46-53,
February 1995.
-
T. von Eicken, D. E. Culler, S. C. Goldstein, and K. E. Schauser, "Active
Messages: a Mechanism for Integrated Communication and Computation," In
Proceedings of the 19th International Symposium on Computer
Architecture, pp. 256-66, Gold Coast, Queensland, Australia, May 1992.
-
"Virtual Interface Architecture Specification, Version 1.0, published by
Compaq, Intel, and Microsoft," December 1997.
(available online)
-
M. D. Welsh, A. Basu, and T. von Eicken, "ATM and Fast Ethernet Network
Interfaces for User-Level Communication," In Proceedings of 3rd
International Symposium on High-Performance Computer Architecture, pp.
332-42, San Antonio, Texas, February 1997.
-
R. W. Wisniewski, L. I. Kontothanassis, and M. L. Scott, "High Performance
Synchronization Algorithms for Multiprogrammed Multiprocessors," In
Proceedings of the 5th Symposium on Principles and Practice of Parallel
Programming, pp. 199-206, Santa Barbara, California, July 1995.
-
P. R. Woodward, "Perspectives on Supercomputing: Three Decades of Change,"
IEEE Computer, 29:99-111, October 1996.
-
D. Yeung, Multigrain Shared Memory, PhD thesis, Massachusetts
Institute of Technology, February 1998.
-
D. Yeung, J. Kubiatowicz, and A. Agarwal, "MGS: A Multigrain Shared Memory
System," In Proceedings of the 23rd International Symposium on Computer
Architecture, pp. 44-55, Philadelphia, Pennsylvania, May 1996.
-
K. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A. Krishnamurthy, P.
Hilfinger, S. Graham, D. Gay, P. Colella, and A. Aiken, "Titanium: A
High-Performance Java Dialect," In Proceedings of the Workshop on Java
for High-Performance Network Computing, Stanford, California, February
1998.
-
M. Young, A. Tevanian, R. Rashid, D. Golub, J. Eppinger, J. Chew, W. Bolosky,
D. Black, and R. Baron, "The Duality of Memory and Communication in the
Implementation of a Multiprocessor Operating System," In Proceedings of
the 11th Symposium on Operating Systems Principles, pp. 63-76, Austin,
Texas, November 1987.
Return to Steve's bibliography page.
Return to Steve's home page.