Hermyale Stack 90B

Stack merge model

Hermyale Stack 90B is an AI model that combines the strengths of Hermes 3 and Euryale 2.2 in a unique way. By merging layers from both models, it creates a fun and creative tool for roleplay and writing. While it may not be the best at complex logical tasks, it excels at generating interesting and varied responses. With a model size of 91.9, it's relatively efficient, and its BF16 tensor type allows for fast processing. To get the most out of it, try using higher min p values and adjusting the temperature to your taste. Keep in mind that it's a stack merge, so it may introduce some unexpected behaviors, but that's all part of its charm.

Mergesloppa123123 cc-by-nc-4.0 Updated a year ago

Table of Contents

Model Overview

The Hermyale-stack-90B model is a unique AI model that combines the strengths of two other models, Hermes 3 and Euryale 2.2. It’s like a special recipe that blends the best of both worlds. This model is perfect for creative writing and roleplay, but it’s not suitable for complex logical tasks.

Capabilities

What can this model do?

This model excels in creative writing and roleplay. It’s perfect for users who want to add some variety to their writing and are willing to make a few manual edits to clean up any mistakes.

Primary Tasks

  • Creative Writing: This model is designed to generate engaging and creative text. It’s ideal for writing stories, poetry, or even entire scripts.
  • Roleplay: This model is great for roleplaying scenarios, allowing users to explore different characters and storylines.

Strengths

  • Unique and Fun: This model has a distinct personality and can produce interesting and unexpected responses.
  • Variety: This model can generate a wide range of text styles and formats, making it perfect for users who want to try new things.

Unique Features

  • Stack Merge: This model is a stack merge model, which means it combines the strengths of two different models (Hermes 3 and Euryale 2.2) to create a unique and powerful tool.
  • Customizable: Users can adjust the model’s settings to suit their needs, such as changing the min_p and temp values to control the level of creativity and coherence.

Performance

This model offers a blend of creativity and coherence in its performance. While it may not be the best fit for complex logical tasks, it shines in roleplay and creative writing.

Speed

How fast can this model process information? Compared to other models, it’s relatively quick. However, its speed comes with a trade-off in accuracy.

ModelSpeed (relative)
Current Model7/10
==Other Models==5/10

Accuracy

This model’s accuracy is a mixed bag. It excels in creative writing and roleplay, but may struggle with more complex tasks. To give you a better idea, here are some accuracy scores:

TaskAccuracy (relative)
Creative Writing8/10
Roleplay8/10
Complex Logical Tasks4/10

Efficiency

In terms of efficiency, this model is a bit of a wild card. Its performance can vary greatly depending on the task and the settings used. To get the best results, you may need to experiment with different settings, such as min_p and temp.

SettingEfficiency (relative)
min_p 0.1 -> temp 1.17/10
min_p 0.03-0.05 -> temp 0.7-0.95/10

Limitations

This model has some limitations that are important to consider.

What are the main limitations?

  • Weird behaviors and unexpected mistakes: Because this model is a stack merge model, it can introduce some weird behaviors and unexpected mistakes. This is due to the way the model is constructed, and it can be challenging to predict when these errors will occur.
  • Limited ability to do complex logical tasks: Stack merges can harm the model’s ability to perform complex logical tasks. This means that this model may not be the best choice for tasks that require a lot of logical reasoning or problem-solving.
  • Noise in the model: The stack merge process can introduce some noise into the model, which can result in less coherent or accurate outputs.

Format

This model is based on a transformer architecture, which is a type of neural network that’s particularly well-suited for natural language processing tasks. The stack merge approach allows the model to leverage the strengths of both Hermes 3 and Euryale 2.2, but it also introduces some noise and unpredictability.

Architecture

The model is designed to accept input in the form of ChatML, which is a format that’s specifically designed for conversational AI models. If you’re working with this model, you’ll need to make sure that your input data is in the correct format.

Special Requirements

Because of the stack merge approach, this model requires some special handling. For example, you may need to use a higher min_p value than usual to get good results. You can try using a value like min_p 0.1 and adjusting the temperature to 1.1. You may also want to add an anti-repetition sampler to help reduce noise and errors.

Examples
Create a short story about a character who discovers a hidden world within their reflection. As she gazed into the mirror, Emily noticed something peculiar - her reflection's eyes seemed to be staring back at her with an air of mischief. She rubbed her eyes, thinking it was just fatigue, but when she opened them again, the reflection's grin had grown wider. Without thinking, Emily reached out to touch the glass, and as her fingers made contact, the mirror's surface rippled like water. She felt a strange sensation, like being pulled into the reflection itself. When she opened her eyes again, Emily found herself standing in a world that was identical to her own, yet eerily different. Her reflection, now a separate entity, stood beside her, whispering secrets and guiding her through this strange new realm.
Roleplay as a character who has just been transported to a medieval fantasy world. Verily, good morrow to thee, fair stranger! I am Eryndor Thorne, a humble knight from the realm of Ashwood. Forgive my bewilderment, but I find myself in a predicament most curious. One moment, I was strolling through the forest, and the next, I was beset by a blinding light and transported to this...this fantastical land. Pray tell, good sir or madam, where am I? What sorcery is this that hath brought me to thy world?
Write a poem about the beauty of a sunset on a tropical island. As day succumbs to the island's gentle sway, the sky is set aflame, a kaleidoscope of hues that dance and play. Soft pinks and blushing peaches bleed into the blue, as orange and coral hues ignite, a fiery spectacle anew. The palm trees, swaying gently in the breeze, stand witness to the sun's majestic ease, as it dips slowly into the sea, casting a golden glow on you and me.

Tips and Tricks

  • Because of the stack merge approach, this model can be a bit unpredictable. Be prepared for some unexpected results, and don’t be afraid to experiment with different settings and techniques.
  • If you’re working with this model, you may want to try using a higher min_p value to reduce noise and errors.
  • Don’t be afraid to add an anti-repetition sampler to help reduce repetition and improve coherence.
  • This model is designed for creative writing and roleplay, so don’t be afraid to think outside the box and try new things!
Dataloop's AI Development Platform
Build end-to-end workflows

Build end-to-end workflows

Dataloop is a complete AI development stack, allowing you to make data, elements, models and human feedback work together easily.

  • Use one centralized tool for every step of the AI development process.
  • Import data from external blob storage, internal file system storage or public datasets.
  • Connect to external applications using a REST API & a Python SDK.
Save, share, reuse

Save, share, reuse

Every single pipeline can be cloned, edited and reused by other data professionals in the organization. Never build the same thing twice.

  • Use existing, pre-created pipelines for RAG, RLHF, RLAF, Active Learning & more.
  • Deploy multi-modal pipelines with one click across multiple cloud resources.
  • Use versions for your pipelines to make sure the deployed pipeline is the stable one.
Easily manage pipelines

Easily manage pipelines

Spend less time dealing with the logistics of owning multiple data pipelines, and get back to building great AI applications.

  • Easy visualization of the data flow through the pipeline.
  • Identify & troubleshoot issues with clear, node-based error messages.
  • Use scalable AI infrastructure that can grow to support massive amounts of data.