Model name: t5_local

About T5

T5 is a text-to-text transformer model that converts input text to output text. It's based on the Transformer architecture where an encoder processes the input and a decoder then generates the output. It's pre-trained on a large corpus of text data and is capable of performing a wide range of natural language processing tasks, such as chatbots, translation, summarization, and question answering.

Read more about T5 on Wikipedia.

Supported aidb operations

  • encode_text
  • decode_text
  • encode_text_batch
  • decode_text_batch

Supported models

  • t5-small (default)

See more in the Support matrix.

Creating the default model

SELECT aidb.create_model('my_t5_model', 't5_local');

Creating a specific model

SELECT aidb.create_model(
  'another_t5_model',
  't5_local',
  '{"model": "t5-large", "revision": "main"}'::JSONB
)

Model configuration settings

The following configuration settings are available for T5 models:

  • model The T5 model to use. The default is t5-small.
  • revision The revision of the model to use. The default is refs/pr/15.
  • temperature The temperature to use for sampling. The default is 0.0.
  • seed The random seed to use for sampling. The default is 1599222198345926291.
  • max_tokens The maximum number of tokens to generate. The default is 16384.
  • repeat_penalty The repetition penalty to use. The default is 1.1.
  • repeat_last_n The number of tokens to consider for the repetition penalty. The default is 64.

Model credentials

No credentials are required for the T5 model.