nnΒΆ
Common torch.nn.Module implementations.
Submodules
nn.attentionSlidingWindowAttentionConfigGateGranularityGateConfigAttentionTypeAttentionBackendNameAttentionBackendTorchAttentionBackendFlashAttention2BackendFlashAttention3BackendFlashAttention4BackendTEAttentionBackendAttentionConfigAttentionFusedAttentionNormalizedAttentionRingAttentionLoadBalancerTypeRingAttentionLoadBalancerRingAttentionZigZagLoadBalancerRingAttentionLlama3LoadBalancerUlyssesLoadBalancerRingContextParallelStyleUlyssesContextParallelStyleGatedDeltaNetConfigGatedDeltaNet
nn.conversionnn.feed_forwardnn.functionalnn.hfconvert_checkpoint_to_hf()convert_hybrid_state_to_hf()convert_state_from_hf()convert_state_to_hf()get_converter_from_hf()get_converter_to_hf()get_hf_config()get_hybrid_hf_config()get_hybrid_layer_types()is_olmo_hybrid_model()load_config()load_hf_model()save_hf_hybrid_model()save_hf_model()HF_TO_OLMO_CORE_WEIGHT_MAPPINGSHF_TO_OLMO_CORE_MODULE_MAPPINGSMODEL_TYPE_SPECIFIC_HF_TO_OLMO_CORE_WEIGHT_MAPPINGSMODEL_TYPE_SPECIFIC_HF_TO_OLMO_CORE_MODULE_MAPPINGSHF_TO_OLMO_CORE_TEMPLATE_MAPPINGSOLMO_CORE_TO_HF_WEIGHT_MAPPINGSOLMO_CORE_TO_HF_MODULE_MAPPINGSOLMO_CORE_TO_HF_TEMPLATE_MAPPINGSMODEL_TYPE_SPECIFIC_OLMO_CORE_TO_HF_TEMPLATE_MAPPINGS
nn.layer_normnn.lm_headnn.moenn.ropenn.transformerTransformerTypeTransformerConfigTransformerNormalizedTransformerMoETransformerMoEHybridTransformerBlockBaseMoEHybridTransformerBlockMoEHybridReorderedNormTransformerBlockTransformerBlockTypeTransformerBlockConfigTransformerBlockBaseTransformerBlockReorderedNormTransformerBlockLayerNormScaledTransformerBlockPeriNormTransformerBlockNormalizedTransformerBlockMoETransformerBlockMoEReorderedNormTransformerBlockInitMethod