LLM Analysis¶

Latency and memory analysis of Large Language Models (LLMs) / Transformer Models for training and inference.

Analysis¶

  • Analysis
    • ActivationRecomputation
    • DSZeRO
    • DtypeConfig
    • Enum
    • GPUConfig
    • LLMAnalysis
    • ModelConfig
    • ParallelismConfig
    • get_dtype_config_by_name()
    • get_gpu_config_by_name()
    • get_model_config_by_name()
    • infer()
    • pformat()
    • total_ordering()
    • train()
    • within_range()

Configurations¶

  • Configurations
    • DtypeConfig
    • EnhancedJSONEncoder
    • GPUConfig
    • ModelConfig
    • ParallelismConfig
    • dump_configs()
    • dump_hf_model_configs_by_type_and_task()
    • dump_model_config_by_name()
    • get_dtype_config_by_name()
    • get_gpu_config_by_name()
    • get_hf_models_by_type_and_task()
    • get_model_config_by_name()
    • get_model_config_from_hf()
    • list_dtype_configs()
    • list_gpu_configs()
    • list_model_configs()
    • populate_model_and_gpu_configs()
    • read_configs()

Constants¶

  • Constants

Indices and tables¶

  • Index

  • Module Index

  • Search Page

llm-analysis

Navigation

  • Analysis
  • Configurations
  • Constants

Related Topics

  • Documentation overview
    • Next: Analysis

Quick search

©2023, Cheng Li. | Powered by Sphinx 7.1.2 & Alabaster 0.7.13 | Page source