Introduction to Scripthea

Scripthea is a freeware Windows application for text-to-image prompt engineering.

Scripthea is my response to the booming development of text-to-image AI generation domain. It provides a systematic approach in composing the text prompt (aka prompt engineering). Briefly, prompt = cue + modifiers. You will be offered collections of short descriptive texts (cues) and categorized collections of modifiers, like a painter, art style, time period, etc. After composing a prompt you can copy (blue icon) and paste it into your favourite text-to-image generator and see to result in your browser. Alternatively, you can access to some generators (local install of Stable Diffusion for now, more to follow). Scripthea is especially useful for creative souls which speak English as an additional language.

The prompt and the picture become part of your collection (image depot) with a convenient image viewer on the second tab. On top of that, you can also "scan". After selecting some cues and some modifiers Scan will combine them (there are rules). So for example if you would like to see how a particular painter would paint different subjects (cues) or how specific topic would be painted by different painters. Scripthea will generate all the combinations for you (scan) and query the active API for you to put them in an image collection.

See a short video introduction clip HERE or on YouTube.

Read some reviews:

  • Scripthea Makes AI Art Creation Effortless. by Ramakanth (here)
  • Scripthea: Unleashing Your Inner Artist with Text-to-Image AI. by Nayedeals  (here)
  • A comprehensive prompt composer tool to help with image-based generative AI, providing a methodical approach to composing complex prompts with minimal effort. by Robert Condorache  (here)

The software is distributed as a freeware and it is an open source project (under MIT license) hosted in GitHub repository.

Experimenting with variety of prompts in a systematic way is the best approach I know to create, improve and stabilize your own style in text-to-image generation.

Here is the prompt composer tab...

On the left, you see the log panel which will text you about any ongoing operations. For prompt composing, there are two modes: Single and Scan. In Single mode, you can use one cue with more than one modifier. In Scan mode, you can select any number of cues and any number of modifiers although each prompt will be combination of one cue + all Fixed modifiers + with a number (modifiers sample number) of Scannable modifiers. Modifiers are divided into switchable (on/off) categories. If you wonder about any modifier, hover over it, there will be a hint for the most of them. If you right-click on any modifier you will be asked to confirm a google search for that modifier. In options, you can specify the image depot folder where the images from your scan (or a single query) will go.

All the options, external and internal sizes and main window position are saved on application closing and retrieved on starting.

...and that is the image depot viewer

The viewer shows a Scripthea image depot (a folder with bunch of images and description.idf file). You can select a image depot folder from the directory browser on the left while the image depot text box rim is highlighted (in navy). Check Viewer page for more details about the directory browser. You can chose between table view and thumbnail (grid) view. In the grid view you can adjust the thumbnails from the menu (bottom left button). You can move around with the arrows on the bottom, all self-explanatory (I think). The only other than viewing operation you can do here is delete an image. On the very bottom common (for both views) there is the find panel which will find a word(s) in the prompts of the active image depot and select it. Mark button will highlight the some of the prompt/images by the same criterion. The shown image itself can be zoomed in/out (buttons), panned (scroll-bars) or fit (the middle button), more tools are comming...

Image Depot Master

Image Depot Master (IDM) is image depot manager for copying and moving images from image depot to another or an empty one, as well deleting images from image depot.

It provides an option to validate image depot consistency as erase entries in description.idf without corresponding images. More complete that validating is synchronizing (three bar menu) which is validating and deleting all the images in the folder without entries.

The selection of a folder (image depot or an empty one) is done the same way as in image viewer, as well as two possible views - list and grid, similar to the viewer arrangement.

The idea of two panels to deal with files is coming from old Norton commander DOS file manager.

Import/Export utilities

The forth tab of Scripthea contains an import utility of converting image files from some some text-to-image generator (e.g. Stable Diffusion, Craiyon). The import utility will convert these images into Scripthea image depot. The description.idf file is a text file where each line is json formatted property dictionary of the generated images including the prompt. You can edit the file for any reason as you like as long as you keep the json structure. Export utility takes an image depot and exports selected subset to another folder with export control of files name and type (.png or .jpg). Optionally Scripthea can create an webpage with the exported images for local browsing or your website.

contact

Keep in mind that the application is under active development so I would appreciate any bug report. Let me know HERE and I'll do my best to fix it ASAP. In the same way, you can communicate any ideas for improvement, experiences with the software or your willingness to help me with the project.
I would especially appreciate more cue collections preferably organized by subject, see cues folder for *.cues files.

legal

Scripthea software has been written by and is copyrighted to Teodor Krastev. The sources are distributed under MIT's open source license.

the name "Scripthea"

The name Scripthea (or Scrip(t)Thea) is coming from script (written text, from Latin scriptum) and Theathe Greek goddess of sight and vision.