Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/320080.320119acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free Access

Selective cache ways: on-demand cache resource allocation

Authors Info & Claims
Published:16 November 1999Publication History

ABSTRACT

Increasing levels of microprocessor power dissipation call for new approaches at the architectural level that save energy by better matching of on-chip resources to application requirements. Selective cache ways provides the ability to disable a subset of the ways in a set associative cache during periods of modest cache activity, while the full cache may remain operational for more cache-intensive periods. Because this approach leverages the subarray partitioning that is already present for performance reasons, only minor changes to a conventional cache are required, and therefore, full-speed cache operation can be maintained. Furthermore, the tradeoff between performance and energy is flexible, and can be dynamically tailored to meet changing application and machine environmental conditions. We show that trading off a small performance degradation for energy savings can produce a significant reduction in cache energy dissipation using this approach.

References

  1. 1.D. Albonesi. Dynamic IPC/clock rate optimization. Proceedings of the 25th International Symposium on Computer Architecture, pages 282-292, June 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.G. Ammons, T. Ball, and J. Larus. Exploiting hardware performance counters with flow and context sensitive profiling. Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, June 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.J. Anderson et al. Continuous profiling: Where have all the cycles gone? Proceedings of the 16th Symposium on Operating Systems Principles, October 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.A. Argawal, J. Hennessy, and M. Horowitz. Cache performance of operating system and multiprogramming workloads. ACM Transactions on Computer Systems, 6(4):393- 431, November 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.P. Bannon. Alpha 21364: A scalable single-chip SMP. Microprocessor Forum, October 1998.Google ScholarGoogle Scholar
  6. 6.N. Bellas et al. Architectural and compiler support for energy reduction in the memory hierarchy of high performance microprocessors. Proceedings of the International Symposium on Low Power Electronics and Design, pages 70-75, August 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.W. Bowhill et al. Circuit implementation of a 300-MHz 64- bit second-generation CMOS Alpha CPLI. Digital Technical Journal, 7(1):100-118, Special Issue 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.D. Burger and T. Austin. The SimpleScalar toolset, version 2.0. Technical Report TR-97-1342, University of Wisconsin-Madison, June 1997.Google ScholarGoogle Scholar
  9. 9.A. Chandrakasan, S. Sheng, and R. Brodersen. Low-power CMOS digital design. IEEE Journal of Solid-State Circuits, 27(4):473-484, April 1992.Google ScholarGoogle ScholarCross RefCross Ref
  10. 10.J. Dean et al. ProfileMe: Hardware support for instructionlevel profiling in out-of-order processors. Proceedings of the 30th International Symposium on Microarchitecture, pages 292-302, December 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.P. Dinda et al. The CMU task parallel program suite. Technical Report CMU-CS-94-131, Carnegie Mellon University, March 1994.Google ScholarGoogle Scholar
  12. 12.D. Dobberpuhl et al. A 200MHz, 64-bit, dual-issue CMOS microprocessor. Digital Technical Journal, 4(4):35-50, Special Issue 1992.Google ScholarGoogle Scholar
  13. 13.J. Edmondson et al. Internal organization of the Alpha 21164, a 300MHz 64-bit quad-issue CMOS RISC microprocessor. Digital Technical Journal, 7(1): 119-135, Special Issue 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.T. Horel and G. Lauterbach. UltraSPARC III: Designing third-generation 64-bit performance. IEEE Micro, 19(3):73- 85, May/June 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.M. Kamble and K. Ghose. Analytical energy dissipation models for low power caches. Proceedings of the International Symposium on Low Power Electronics and Design, pages 143-148, August 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.R. Kessler. The Alpha 21264 microprocessor. IEEE Micro, 19(2):24-36, March/April 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.R. Kessler, E. McLellan, and D. Webb. The Alpha 21264 microprocessor architecture. International Conference on Computer Design, October 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.J. Kin, M. Gupta, and W. Mangione-Smith. The filter cache: An energy efficient memory structure. Proceedings of the 29th International Symposium on Microarchitecture, pages 184-193, December 199'/. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.A. Kumar. The HP PA-8000 RISC CPU. IEEE Computer, 17(2):27-32, March 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.A. Lebeck and D. Wood. Cache profiling and the SPEC benchmarks: A case study. IEEE Computer, 27(10):15-26, October 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.G. Lesartre and D. Hunt. PA-8500: The continuing evolution of the PA-8000 family. Proceedings of Compcon, 1997.Google ScholarGoogle Scholar
  22. 22.E. McLellan. The Alpha AXP architecture and 21064 processor. IEEE Micro, 13(4):36-47, June 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.J. Montanaro et al. A 160-MHz, 32-b, 0.5W CMOS RISC microprocessor. Digital Technical Journal, 9(1):49-62, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.M. Tremblay and J. O'Connor. UltraSparc l: A four-issue processor supporting multimedia. IEEE Micro, 16(2):42- 50, April 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.S. Wilton and N. Jouppi. An enhanced access and cycle time model for on-chip caches. Technical Report 93/5, Digital Western Research Laboratory, July 1994.Google ScholarGoogle Scholar
  26. 26.X. Zhang et al. System support for automatic profiling and optimization. Proceedings of the 16th Symposium on Operating Systems Principles, October 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Selective cache ways: on-demand cache resource allocation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MICRO 32: Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
        November 1999
        299 pages
        ISBN:076950437X

        Copyright © Copyright (c) 1998 Institute of Electrical and Electronics Engineers, Inc. All rights reserved.

        Publisher

        IEEE Computer Society

        United States

        Publication History

        • Published: 16 November 1999

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        MICRO 32 Paper Acceptance Rate27of131submissions,21%Overall Acceptance Rate484of2,242submissions,22%

        Upcoming Conference

        MICRO '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader