models

Models

MarkovCombineError

Bases: Exception

Exception raised when attempt to combine incompatible model chains.

MarkovEmptyError

Bases: Exception

Raised when attempting to do model comparison and inspection on empty models.

MarkovTextModel

Bases: Model

Stores a compiled markov text model.

Attributes:

Name	Type	Description
`created`	`datetime`	Date and time when the model was created.
`modified`	`datetime`	Date and time when the model was last modified.
`data`	`JSON`	The text model as JSON.

is_ready `property`

is_ready: bool

Flag to indicate if the model is initialized and ready to generate sentences.

aadd_new_corpus_data_to_model `async`

aadd_new_corpus_data_to_model(
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    weights: list[float] | None = None
) -> None

Takes a list of new corpus entries and incorporates them into the model. Unlike aupdate_model_from_corpus, this method is additive. This works by first creating a text model based on the new entries, and then uses markovify.combine to add them to the existing text model. Note that this will fail if the stored model is compiled.

Parameters:

Name	Type	Description	Default
`corpus_entries`	`list[str]`	A list of text sentences to add.	required
`char_limit`	`int \| None`	The character limit to use for the new corpus. Use `0` for no limit.	`None`
`weights`	`list[float] \| None`	The weighting to use for combine operation, the first value representing the saved model, and the second representing the new entries.	`None`

Raises:

Type	Description
`MarkovCombineError`	If the stored model is already compiled.
`MarkovEmptyError`	If the new models are empty.

Source code in src/django_markov/models.py

async def aadd_new_corpus_data_to_model(
    self,
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    weights: list[float] | None = None,
) -> None:
    """Takes a list of new corpus entries and incorporates them into the model.
    Unlike `aupdate_model_from_corpus`, this method is additive. This works by
    first creating a text model based on the new entries, and then uses
    `markovify.combine` to add them to the existing text model. Note that
    this will fail if the stored model is compiled.

    Args:
        corpus_entries (list[str]): A list of text sentences to add.
        char_limit (int | None): The character limit to use for the new corpus.
            Use `0` for no limit.
        weights (list[float] | None): The weighting to use for combine
            operation, the first value representing the saved model, and the second
            representing the new entries.

    Raises:
        MarkovCombineError: If the stored model is already compiled.
        MarkovEmptyError: If the new models are empty.
    """
    saved_model = self._as_text_model()
    if self.data is None or self.data == "" or saved_model is None:
        # There's no existing model, use update instead.
        return await self.aupdate_model_from_corpus(
            corpus_entries=corpus_entries, char_limit=char_limit
        )
    if char_limit is None:
        char_limit = _get_corpus_char_limit()
    if weights is not None and len(weights) != 2:  # noqa: PLR2004
        msg = "If provided, weights must have exactly two entries!"
        raise ValueError(msg)
    corpus = " ".join(corpus_entries)
    if len(corpus_entries) == 0 or corpus.replace(" ", "") == "":
        msg = "There are no corpus entries to add!"
        raise MarkovEmptyError(msg)
    if saved_model.chain.compiled:
        msg = "Saved model is compiled, cannot combine!"
        raise MarkovCombineError(msg)
    new_model = POSifiedText(corpus, state_size=saved_model.state_size)
    try:
        combined_model = markovify.combine(
            [saved_model, new_model], weights=weights
        )
    except ValueError as ve:  # no cov
        # If markovify raises any other unexpected error.
        msg = f"The following error occurred while combining: {ve}"
        raise MarkovCombineError(msg) from ve
    if (
        combined_model is not None and type(combined_model) is POSifiedText
    ):  # no cov
        self.data = combined_model.to_json()
        await self.asave()

acombine_models `async` `classmethod`

acombine_models(
    models: list[MarkovTextModel],
    *,
    return_type: Literal[
        "model_instance", "text_model"
    ] = "model_instance",
    mode: Literal["strict", "permissive"] = "strict",
    weights: list[float] | None = None
) -> tuple[MarkovTextModel | POSifiedText, int]

Combine multiple MarkovTextModels into a single model.

Models cannot be combined if any of the following is true

They are empty of data.
They are stored in compiled state.
The state size between models is not the same.
The underlying text models are not the same type (if you subclass).
You supply weights, but not the same number as the models to combine or if you use permissive mode.

Parameters:

Name	Type	Description	Default
`models`	`list[MarkovTextModel]`	A list of MarkovTextModel instances to combine.	required
`return_type`	`Literal['model_instance', 'text_model']`	The desired result type.	`'model_instance'`
`mode`	`Literal['strict', 'permissive']`	strict indicates that an exception should be raised if any of the candidate models are incompatible, or if those specific instances should simply be dropped from the operation.	`'strict'`
`weights`	`list[float] \| None`	A list of floats representing the relative weights to put on each source. Optional, but can only be used with mode='strict'.	`None`

Returns:

Type	Description
`tuple[MarkovTextModel \| POSifiedText, int]`	Either a new MarkovTextModel instance persisted to the database or a POSifiedText object to manipulate at a low level, and the total number of models combined.

Raises:

Type	Description
`ValueError`	If any of the parameter combinations is invalid
`MarkovCombineError`	If models are incompatible for combining or a markovify error is raised.

Source code in src/django_markov/models.py

@classmethod
async def acombine_models(
    cls,
    models: list["MarkovTextModel"],
    *,
    return_type: Literal["model_instance", "text_model"] = "model_instance",
    mode: Literal["strict", "permissive"] = "strict",
    weights: list[float] | None = None,
) -> tuple["MarkovTextModel | POSifiedText", int]:
    """Combine multiple MarkovTextModels into a single model.

    Models cannot be combined if any of the following is true:
        - They are empty of data.
        - They are stored in compiled state.
        - The state size between models is not the same.
        - The underlying text models are not the same type (if you subclass).
        - You supply weights, but not the same number as the models to combine
            or if you use permissive mode.

    Args:
        models (list[MarkovTextModel]): A list of MarkovTextModel instances to
            combine.
        return_type (Literal["model_instance", "text_model"]): The desired result
             type.
        mode (Literal["strict", "permissive"]): strict indicates that an exception
            should be raised if any of the candidate models are incompatible, or
            if those specific instances should simply be dropped from the operation.
        weights (list[float] | None): A list of floats representing the relative
            weights to put on each source. Optional, but can only be used with
            mode='strict'.

    Returns:
        Either a new MarkovTextModel instance
            persisted to the database or a POSifiedText object to manipulate at a
            low level, and the total number of models combined.

    Raises:
        ValueError: If any of the parameter combinations is invalid
        MarkovCombineError: If models are incompatible for combining or a markovify
            error is raised.
    """
    # First we check to ensure that the models are combinable.
    empty_models = []
    compiled_models = []
    workable_models = []
    invalid_state_sizes = []
    if mode not in ["strict", "permissive"]:
        msg = f"Invalid mode: {mode}. Must be one of 'strict' or 'permissive'!"
        raise ValueError(msg)
    if weights is not None and mode != "strict":
        msg = "Weights can only be provided if mode is set to strict!"
        raise ValueError(msg)
    if return_type not in ["model_instance", "text_model"]:
        msg = (
            f"Invalid return_type of {return_type} requested. Must be one of "
            "'model_instance' or 'text_model'"
        )
        raise ValueError(msg)
    current_state_size = 0
    for model in models:
        if not model.is_ready:
            empty_models.append(model)
        else:
            tm = model._as_text_model()
            if tm is None:
                empty_models.append(
                    model
                )  # no cov, catchall to make pyright happy.
            else:
                if current_state_size == 0:
                    current_state_size = tm.state_size
                if tm.state_size != current_state_size:
                    invalid_state_sizes.append(model)
                elif tm and tm.chain.compiled:
                    compiled_models.append(model)
                else:
                    workable_models.append(model)
    if mode == "strict":
        if empty_models or compiled_models or invalid_state_sizes:
            msg = f"There are {len(compiled_models)} compiled models, "
            f"{len(invalid_state_sizes)} models with incompatible state sizes, "
            f"and {len(empty_models)} empty models in set!"
            raise MarkovCombineError(msg)
    if len(workable_models) <= 1:
        msg = f"There is only {len(workable_models)}. Cannot combine!"
        raise MarkovCombineError(msg)
    models_combined = len(workable_models)
    try:
        combined_model = markovify.combine(
            models=[m._as_text_model() for m in workable_models], weights=weights
        )
    except ValueError as m_err:
        msg = f"Combining models caused the following error: {m_err}"
        raise MarkovCombineError(msg) from m_err
    if not isinstance(combined_model, POSifiedText):  # no cov
        msg = "Received invalid result from markovify. "
        f"Returned type is {type(combined_model)}"
        raise MarkovCombineError(msg)
    if return_type == "text_model":
        return combined_model, models_combined  # type: ignore
    new_model = await MarkovTextModel.objects.acreate(data=combined_model.to_json())
    return new_model, models_combined

add_new_corpus_data_to_model

add_new_corpus_data_to_model(
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    weights: list[float] | None = None
) -> None

Sync wrapper for aadd_new_corpus_data_to_model.

Parameters:

Name	Type	Description	Default
`corpus_entries`	`list[str]`	A list of text sentences to add.	required
`char_limit`	`int \| None`	The character limit to use for the new corpus. Use `0` for no limit.	`None`
`weights`	`list[float] \| None`	The weighting to use for combine operation, the first value representing the saved model, and the second representing the new entries.	`None`

Raises:

Type	Description
`MarkovCombineError`	If the stored model is already compiled.
`MarkovEmptyError`	If the new models are empty.
`ValueError`	If weights are supplied, and they do not have a length of two.

Source code in src/django_markov/models.py

def add_new_corpus_data_to_model(
    self,
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    weights: list[float] | None = None,
) -> None:
    """Sync wrapper for `aadd_new_corpus_data_to_model`.

    Args:
        corpus_entries (list[str]): A list of text sentences to add.
        char_limit (int | None): The character limit to use for the new corpus.
            Use `0` for no limit.
        weights (list[float] | None): The weighting to use for combine
            operation, the first value representing the saved model, and the second
            representing the new entries.

    Raises:
        MarkovCombineError: If the stored model is already compiled.
        MarkovEmptyError: If the new models are empty.
        ValueError: If weights are supplied, and they do not have a length of two.
    """
    return async_to_sync(self.aadd_new_corpus_data_to_model)(
        corpus_entries=corpus_entries, char_limit=char_limit, weights=weights
    )

agenerate_sentence `async`

agenerate_sentence(
    char_limit: int = 0, tries: int = 10
) -> str | None

Generates a random sentence within the character limit based on the model.

Parameters:

Name	Type	Description	Default
`char_limit`	`int`	Maximum characters to use. If zero, no limit.	`0`
`tries`	`int`	Number of attempts to make a sentence.	`10`

Returns: str: Random sentence

Source code in src/django_markov/models.py

async def agenerate_sentence(
    self, char_limit: int = 0, tries: int = 10
) -> str | None:
    """Generates a random sentence within the character limit
    based on the model.

    Args:
        char_limit (int): Maximum characters to use. If zero, no limit.
        tries (int): Number of attempts to make a sentence.
    Returns:
        str: Random sentence
    """
    if not self.is_ready or not self._compiled_model:
        return None
    sentence: str | None
    if char_limit > 0:
        sentence = self._compiled_model.make_short_sentence(
            max_chars=char_limit, tries=tries
        )
    else:
        sentence = self._compiled_model.make_sentence(tries=tries)
    # Emit a signal that can be used by other apps for things such as statistics.
    # Right now, pyright doesn't recognize the asend method as valid member of
    # django.dispatch.Signal
    if sentence is not None:
        await sentence_generated.asend(  # type: ignore
            sender=self.__class__,
            instance=self,
            char_limit=char_limit,
            sentence=sentence,
        )
    return sentence

aupdate_model_from_corpus `async`

aupdate_model_from_corpus(
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    store_compiled: bool | None = None
) -> None

Takes the a list of entries as the new full corpus and recreates the model, saving it. The corpus must not exceed the char_limit.

Parameters:

Name	Type	Description	Default
`corpus_entries`	`list[str]`	The corpus as a list of text sentences.	required
`char_limit`	`int \| None`	The maximum number of characters to allow in the corpus.	`None`
`store_compiled`	`bool \| None`	Whether to store the model in it's compiled state. If None, defaults to settings.MARKOV_STORE_COMPILED_MODELS or False.	`None`

Raises:

Type	Description
`ValueError`	If the corpus is beyond the maximum character limit.

Source code in src/django_markov/models.py

async def aupdate_model_from_corpus(
    self,
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    store_compiled: bool | None = None,
) -> None:
    """Takes the a list of entries as the new full corpus and recreates the model,
    saving it. The corpus must not exceed the char_limit.

    Args:
        corpus_entries (list[str]): The corpus as a list of text sentences.
        char_limit (int | None): The maximum number of characters
            to allow in the corpus.
        store_compiled (bool | None): Whether to store the model in it's compiled
            state. If None, defaults to settings.MARKOV_STORE_COMPILED_MODELS or
            False.

    Raises:
        ValueError: If the corpus is beyond the maximum character limit.
    """
    if not char_limit:
        char_limit = _get_corpus_char_limit()
    if store_compiled is None:
        store_compiled = _get_default_compile_setting()
    corpus = " ".join(corpus_entries)
    if char_limit != 0 and char_limit < len(corpus):
        msg = f"Supplied corpus is over the maximum character limit: {char_limit}"
        raise ValueError(msg)
    updated_model = POSifiedText(corpus, state_size=STATE_SIZE)
    if store_compiled:
        updated_model.compile(inplace=True)
    self.data = updated_model.to_json()
    await self.asave()

combine_models `classmethod`

combine_models(
    models: list[MarkovTextModel],
    *,
    return_type: Literal[
        "model_instance", "text_model"
    ] = "model_instance",
    mode: Literal["strict", "permissive"] = "strict",
    weights: list[float] | None = None
) -> tuple[MarkovTextModel | POSifiedText, int]

Sync wrapper of acombine_models.

Combine multiple MarkovTextModels into a single model.

Models cannot be combined if any of the following is true

They are empty of data.
They are stored in compiled state.
The state size between models is not the same.
The underlying text models are not the same type (if you subclass).
You supply weights, but not the same number as the models to combine or if you use permissive mode.

Parameters:

Name	Type	Description	Default
`models`	`list[MarkovTextModel]`	A list of MarkovTextModel instances to combine.	required
`return_type`	`Literal['model_instance', 'text_model']`	The desired result type.	`'model_instance'`
`mode`	`Literal['strict', 'permissive']`	strict indicates that an exception should be raised if any of the candidate models are incompatible, or if those specific instances should simply be dropped from the operation.	`'strict'`
`weights`	`list[float] \| None`	A list of floats representing the relative weights to put on each source. Optional, but can only be used with mode='strict'.	`None`

Returns:

Type	Description
`tuple[MarkovTextModel \| POSifiedText, int]`	Either a new MarkovTextModel instance persisted to the database or a POSifiedText object to manipulate at a low level, and the total number of models combined.

Raises:

Type	Description
`ValueError`	If any of the parameter combinations is invalid
`MarkovCombineError`	If models are incompatible for combining or a markovify error is raised.

Source code in src/django_markov/models.py

@classmethod
def combine_models(
    cls,
    models: list["MarkovTextModel"],
    *,
    return_type: Literal["model_instance", "text_model"] = "model_instance",
    mode: Literal["strict", "permissive"] = "strict",
    weights: list[float] | None = None,
) -> tuple["MarkovTextModel | POSifiedText", int]:
    """
    Sync wrapper of acombine_models.

    Combine multiple MarkovTextModels into a single model.

    Models cannot be combined if any of the following is true:
        - They are empty of data.
        - They are stored in compiled state.
        - The state size between models is not the same.
        - The underlying text models are not the same type (if you subclass).
        - You supply weights, but not the same number as the models to combine
            or if you use permissive mode.

    Args:
        models (list[MarkovTextModel]): A list of MarkovTextModel instances to
            combine.
        return_type (Literal["model_instance", "text_model"]): The desired result
             type.
        mode (Literal["strict", "permissive"]): strict indicates that an exception
            should be raised if any of the candidate models are incompatible, or
            if those specific instances should simply be dropped from the operation.
        weights (list[float] | None): A list of floats representing the relative
            weights to put on each source. Optional, but can only be used with
            mode='strict'.

    Returns:
        Either a new MarkovTextModel instance
            persisted to the database or a POSifiedText object to manipulate at a
            low level, and the total number of models combined.

    Raises:
        ValueError: If any of the parameter combinations is invalid
        MarkovCombineError: If models are incompatible for combining or a markovify
            error is raised.
    """
    return async_to_sync(cls.acombine_models)(  # no cov
        models=models, return_type=return_type, mode=mode, weights=weights
    )

generate_sentence

generate_sentence(
    char_limit: int = 0, tries: int = 0
) -> str | None

Sync wrapper for agenerate_sentence.

Source code in src/django_markov/models.py

def generate_sentence(self, char_limit: int = 0, tries: int = 0) -> str | None:
    """Sync wrapper for agenerate_sentence."""
    return async_to_sync(self.agenerate_sentence)(
        char_limit=char_limit, tries=tries
    )  # no cov

is_compiled_model

is_compiled_model() -> bool

Checks if the stored data for the text mile is compiled.

Raises:

Type	Description
`MarkovEmptyError`	if the model data is null.

Source code in src/django_markov/models.py

@cached_property
def is_compiled_model(self) -> bool:
    """
    Checks if the stored data for the text mile is compiled.

    Raises:
        MarkovEmptyError: if the model data is null.
    """
    text_model = self._as_text_model()
    if text_model is None:
        msg = "There is not data in this model and it cannot be inspected."
        raise MarkovEmptyError(msg)
    return text_model.chain.compiled

refresh_from_db

refresh_from_db(*args: Any, **kwargs: Any)

Remove the value of the cached properties before refreshing the data.

Source code in src/django_markov/models.py

def refresh_from_db(self, *args: Any, **kwargs: Any):
    """Remove the value of the cached properties before refreshing the data."""
    super().refresh_from_db(*args, **kwargs)
    for prop in self.cached_properties:
        try:
            del self.__dict__[prop]
        except KeyError:  # no cov
            pass

update_model_from_corpus

update_model_from_corpus(
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    store_compiled: bool | None = None
) -> None

Sync wrapper for the async version Takes the a list of entries as the new full corpus and recreates the model, saving it. The corpus must not exceed the char_limit.

Parameters:

Name	Type	Description	Default
`corpus_entries`	`list[str]`	The corpus as a list of text sentences.	required
`char_limit`	`int \| None`	The maximum number of characters to allow in the corpus.	`None`
`store_compiled`	`bool \| None`	Whether to store the model in it's compiled state. If None, defaults to settings.MARKOV_STORE_COMPILED_MODELS or False.	`None`

Raises:

Type	Description
`ValueError`	If the corpus is beyond the maximum character limit.

Source code in src/django_markov/models.py

def update_model_from_corpus(
    self,
    corpus_entries: list[str],
    *,
    char_limit: int | None = None,
    store_compiled: bool | None = None,
) -> None:
    """Sync wrapper for the async version
    Takes the a list of entries as the new full corpus and recreates the model,
    saving it. The corpus must not exceed the char_limit.

    Args:
        corpus_entries (list[str]): The corpus as a list of text sentences.
        char_limit (int | None): The maximum number of characters
            to allow in the corpus.
        store_compiled (bool | None): Whether to store the model in it's compiled
            state. If None, defaults to settings.MARKOV_STORE_COMPILED_MODELS or
            False.

    Raises:
        ValueError: If the corpus is beyond the maximum character limit.
    """
    async_to_sync(self.aupdate_model_from_corpus)(  # no cov
        corpus_entries=corpus_entries,
        char_limit=char_limit,
        store_compiled=store_compiled,
    )

models

MarkovCombineError

MarkovEmptyError

MarkovTextModel

is_ready property

aadd_new_corpus_data_to_model async

acombine_models async classmethod

add_new_corpus_data_to_model

agenerate_sentence async

aupdate_model_from_corpus async

combine_models classmethod

generate_sentence

is_compiled_model

refresh_from_db

update_model_from_corpus

is_ready `property`

aadd_new_corpus_data_to_model `async`

acombine_models `async` `classmethod`

agenerate_sentence `async`

aupdate_model_from_corpus `async`

combine_models `classmethod`