OpenPipe-llm

Author	SHA1	Message	Date
arcticfly	a180b5bef2	Show prompt diff when changing models (#76 ) * Make CompareFunctions more configurable * Change RefinePromptModal styles * Accept newModel in getModifiedPromptFn * Show prompt comparison in SelectModelModal * Pass variant to SelectModelModal * Update instructions * Properly use isDisabled	2023-07-20 23:26:49 -07:00
Kyle Corbitt	55c697223e	Merge pull request #74 from OpenPipe/model-providers replicate/llama2 provider	2023-07-20 23:21:42 -07:00
arcticfly	9978075867	Fix auth flicker (#75 ) * Remove experiments flicker for unauthenticated users * Decrease size of NewScenarioButton spinner	2023-07-20 20:46:31 -07:00
Kyle Corbitt	847753c32b	replicate/llama2 provider Still need to fix the types but it runs	2023-07-20 19:55:03 -07:00
Kyle Corbitt	372c2512c9	Merge pull request #73 from OpenPipe/model-providers More work on modelProviders	2023-07-20 18:56:14 -07:00
Kyle Corbitt	332a2101c0	More work on modelProviders I think everything that's OpenAI-specific is inside modelProviders at this point, so we can get started adding more providers.	2023-07-20 18:54:26 -07:00
arcticfly	1822fe198e	Initially render AutoResizeTextArea without overflow (#72 ) * Rerender resized text area with scroll * Remove default hidden overflow	2023-07-20 15:00:09 -07:00
Kyle Corbitt	f06e1db3db	Merge pull request #71 from OpenPipe/model-providers Prep for more model providers	2023-07-20 14:55:31 -07:00
Kyle Corbitt	ded6678e97	Prep for more model providers Adds a `modelProvider` field to `promptVariants`, currently just set to "openai/ChatCompletion" for all variants for now. Adds a `modelProviders/` directory where we can define and store pluggable model providers. Currently just OpenAI. Not everything is pluggable yet -- notably the code to actually generate completions hasn't been migrated to this setup yet. Does a lot of work to get the types working. Prompts are now defined with a function `definePrompt(modelProvider, config)` instead of `prompt = config`. Added a script to migrate old prompt definitions. This is still partial work, but the diff is large enough that I want to get it in. I don't think anything is broken but I haven't tested thoroughly.	2023-07-20 14:49:22 -07:00
arcticfly	9314a86857	Use translation in initial scenarios (#70 )	2023-07-20 14:28:48 -07:00
David Corbitt	54dcb4a567	Prevent text input labels from overlaying scenarios header	2023-07-20 14:28:36 -07:00
David Corbitt	2c8c8d07cf	Merge branch 'main' of github.com:corbt/prompt-lab	2023-07-20 13:38:58 -07:00
David Corbitt	e885bdd365	Fix ScenarioEditor padding	2023-07-20 13:38:46 -07:00
arcticfly	86dc36a656	Improve refinement (#69 ) * Format construction function on return * Add more refinement examples * Treat 503 like 429 * Define prompt as object * Fix prettier	2023-07-20 13:05:27 -07:00
arcticfly	55c077d604	Create FloatingLabelInput for scenario variables (#68 ) * Create FloatingLabelInput * Fix prettier * Simplify changes	2023-07-20 12:20:12 -07:00
arcticfly	e598e454d0	Add new predefined refinement options (#67 ) * Add new predefined refinement options * Fix prettier * Add icon to SelectModelModal title	2023-07-19 20:10:08 -07:00
David Corbitt	6e3f90cd2f	Add more info to refinement	2023-07-19 18:10:23 -07:00
David Corbitt	eec894e101	Allow multiline instructions	2023-07-19 18:10:04 -07:00
David Corbitt	f797fc3fa4	Eliminate spinner flicker in OutputCell	2023-07-19 18:09:47 -07:00
David Corbitt	335dc0357f	Fix CompareFunctions for mobile	2023-07-19 17:24:19 -07:00
arcticfly	e6e2c706c2	Change up refinement UI (#66 ) * Remove unused ScenarioVariantCell fields * Refine deriveNewConstructFn * Fix prettier * Remove migration script * Add refine modal * Fix prettier * Fix diff checker overflow * Decrease diff height * Add more context to prompt refining * Auto-expand prompt when refining	2023-07-19 17:19:45 -07:00
Kyle Corbitt	7d2166b305	Merge pull request #65 from OpenPipe/no-model Cache cost on ModelOutput	2023-07-19 16:22:35 -07:00
Kyle Corbitt	60765e51ac	Remove model from promptVariant and add cost Storing the model on promptVariant is problematic because it isn't always in sync with the actual prompt definition. I'm removing it for now to see if we can get away with that -- might have to add it back in later if this causes trouble. Added `cost` to modelOutput as well so we can cache that, which is important given that the cost calculations won't be the same between different API providers.	2023-07-19 16:20:53 -07:00
arcticfly	2c4ba6eb9b	Update README.md (#64 )	2023-07-19 15:39:21 -07:00
arcticfly	4c97b9f147	Refine prompt (#63 ) * Remove unused ScenarioVariantCell fields * Refine deriveNewConstructFn * Fix prettier * Remove migration script * Add refine modal * Fix prettier * Fix diff checker overflow * Decrease diff height	2023-07-19 15:31:40 -07:00
arcticfly	58892d8b63	Remove unused fields, refine model translation (#62 ) * Remove unused ScenarioVariantCell fields * Refine deriveNewConstructFn * Fix prettier	2023-07-19 13:59:11 -07:00
Kyle Corbitt	4fa2dffbcb	styling tweaks for SelectModelModal	2023-07-19 07:17:56 -07:00
Kyle Corbitt	654f8c7cf2	Merge pull request #61 from OpenPipe/experiment-page More visual tweaks	2023-07-19 06:56:58 -07:00
Kyle Corbitt	d02482468d	more visual tweaks	2023-07-19 06:54:07 -07:00
Kyle Corbitt	5c6ed22f1d	Merge pull request #60 from OpenPipe/experiment-page experiment page visual tweaks	2023-07-18 22:26:05 -07:00
Kyle Corbitt	2cb623f332	experiment page visual tweaks	2023-07-18 22:22:58 -07:00
Kyle Corbitt	1c1cefe286	Merge pull request #59 from OpenPipe/auth User accounts	2023-07-18 21:21:46 -07:00
Kyle Corbitt	b4aa95edca	sidebar mobile styles	2023-07-18 21:19:06 -07:00
Kyle Corbitt	1dcdba04a6	User accounts Allows for the creation of user accounts. A few notes on the specifics: - Experiments are the main access control objects. If you can view an experiment, you can view all its prompts/scenarios/evals. If you can edit it, you can edit or delete all of those as well. - Experiments are owned by Organizations in the database. Organizations can have multiple members and members can have roles of ADMIN, MEMBER or VIEWER. - Organizations can either be "personal" or general. Each user has a "personal" organization created as soon as they try to create an experiment. There's currently no UI support for creating general orgs or adding users to them; they're just in the database to future-proof all the ACL logic. - You can require that a user is signed-in to see a route using the `protectedProcedure` helper. When you use `protectedProcedure`, you also have to call `ctx.markAccessControlRun()` (or delegate to a function that does it for you; see accessControl.ts). This is to remind us to actually check for access control when we define a new endpoint.	2023-07-18 21:19:03 -07:00
arcticfly	e0e64c4207	Allow user to create a version of their current prompt with a new model (#58 ) * Add dropdown header for model switching * Allow variant duplication * Fix prettier * Use env variable to restrict prisma logs * Fix env.mjs * Remove unnecessary scroll bar from function call output * Properly record when 404 error occurs in queryLLM task * Add SelectedModelInfo in SelectModelModal * Add react-select * Calculate new prompt after switching model * Send newly selected model with creation request * Get new prompt construction function back from GPT-4 * Fix prettier * Fix prettier	2023-07-18 18:24:04 -07:00
arcticfly	fa5b1ab1c5	Allow user to duplicate prompt (#57 ) * Add dropdown header for model switching * Allow variant duplication * Fix prettier	2023-07-18 13:49:33 -07:00
David Corbitt	999a4c08fa	Fix lint and prettier	2023-07-18 11:11:20 -07:00
arcticfly	374d0237ee	Escape characters in Regex evaluations, minor UI fixes (#56 ) * Fix ScenariosHeader stickiness * Move meta tag from _app.tsx to _document.tsx * Show spinner when saving variant * Escape quotes and regex in evaluations	2023-07-18 11:07:04 -07:00
David Corbitt	b1f873623d	Invalidate prompt stats after cell refetch	2023-07-18 09:45:11 -07:00
arcticfly	4131aa67d0	Continue polling VariantStats while LLM retrieval in progress, minor UI fixes (#54 ) * Prevent zoom in on iOS * Expand function return code background to fill cell * Keep OutputStats on far right of cells * Continue polling prompt stats while cells are retrieving from LLM * Add comment to _document.tsx * Fix prettier	2023-07-17 18:04:38 -07:00
Kyle Corbitt	8e7a6d3ae2	Merge pull request #55 from OpenPipe/more-eval Add GPT4 Evals	2023-07-17 18:01:47 -07:00
Kyle Corbitt	7d41e94ca2	cache eval outputs and add gpt4 eval	2023-07-17 17:55:36 -07:00
Kyle Corbitt	011b12abb9	cache output evals	2023-07-17 17:52:30 -07:00
Kyle Corbitt	1ba18015bc	Merge pull request #53 from OpenPipe/more-eval Fix seeds and update eval field names	2023-07-17 14:26:29 -07:00
Kyle Corbitt	54369dba54	Fix seeds and update eval field names	2023-07-17 14:14:20 -07:00
arcticfly	6b84a59372	Properly catch completion errors (#51 )	2023-07-17 10:50:25 -07:00
Kyle Corbitt	8db8aeacd3	Replace function chrome with comment Use a block comment to explain the expected prompt formatting instead of function chrome. The advantage here is that once a user builds a mental model of how OpenPipe works they can just delete the comment, instead of the function chrome sitting around and taking up space in the UI forever.	2023-07-17 10:30:22 -07:00
Kyle Corbitt	64bd71e370	Merge pull request #50 from OpenPipe/remove-default remove the default value for PromptVariant.model	2023-07-14 17:55:38 -07:00
Kyle Corbitt	ca21a7af06	Run checks on main This will (1) make sure that anything we push directly passes CI, and also (2) cache the pnpm store on the main branch, which will make it available to PR runs as well and hopefully speed up CI a bit (see https://stackoverflow.com/a/75250061).``	2023-07-14 17:49:20 -07:00
Kyle Corbitt	3b99b7bd2b	remove the default value for PromptVariant.model We should be explicit about setting the appropriate model so it always matches the constructFn.	2023-07-14 17:43:52 -07:00

1 2 3 4 5 ...

262 Commits