Back to SimpleMetrics page.

PseudoPerplexityMetric

General description

Calculate the pseudo-perplexity from a PerResidueProbabilitiesMetric, which is a value defining the likelihood of a sequence given the prediction of a model (smaller is better). It is defined as the exponentiation of the average negative logarithm of the predicted probabilities.

Autogenerated Tag Syntax Documentation:


A metric for estimating the likeliness of a sequence given some predicted probabilities.

References and author information for the PseudoPerplexityMetric simple metric:

PseudoPerplexityMetric SimpleMetric's author(s): Moritz Ertelt, University of Leipzig [moritz.ertelt@gmail.com]

<PseudoPerplexityMetric name="(&string;)" custom_type="(&string;)"
        metric="(&string;)" use_cached_data="(false &bool;)"
        cache_prefix="(&string;)" cache_suffix="(&string;)"
        fail_on_missing_cache="(true &bool;)" />
  • custom_type: Allows multiple configured SimpleMetrics of a single type to be called in a single RunSimpleMetrics and SimpleMetricFeatures. The custom_type name will be added to the data tag in the scorefile or features database.
  • metric: (REQUIRED) A PerResidueProbabilitiesMetric to calculate the pseudo-perplexity from.
  • use_cached_data: Use any data stored in the datacache that matches the set metrics name (and any prefix/suffix.) Data is stored during a SimpleMetric's apply function, which is called during RunSimpleMetrics
  • cache_prefix: Any prefix used during apply (RunSimpleMetrics), that we will match on if use_cache is true
  • cache_suffix: Any suffix used during apply (RunSimpleMetrics), that we will match on if use_cache is true
  • fail_on_missing_cache: If use_cached_data is True and cache is not found, should we fail?

Example

<ROSETTASCRIPTS>
    <RESIDUE_SELECTORS>
        <Chain name="res" chains="A" />
        <Index name="mask" resnums="25"/>
    </RESIDUE_SELECTORS>
    <SIMPLE_METRICS>
        <PerResidueEsmProbabilitiesMetric name="prediction" residue_selector="res" attention_mask_selection="mask"  write_pssm="test.pssm" model="esm2_t6_8M_UR50D" multirun="true"/>
        <PseudoPerplexityMetric name="perplex" metric="prediction"/>
    </SIMPLE_METRICS>
    <FILTERS>
    </FILTERS>
    <MOVERS>
        <RunSimpleMetrics name="run" metrics="perplex"/>
    </MOVERS>
    <PROTOCOLS>
        <Add mover_name="run"/>
    </PROTOCOLS>
</ROSETTASCRIPTS>

Reference

This is currently unpublished.

See Also