Search with Context Similarity #2

sshivaditya2019 · 2024-10-05T20:38:33Z

Resolves #50

Results for Database fetching backfilling:

A total of 146 issues were identified.
A comprehensive total of 1,238 comments was collected, including comments from pull requests (PRs), PR reviews, and comments specifically related to the identified issues.
Embeddings were generated using Voyage AI for enhanced data analysis.
The data was then converted into CSV format and loaded into Supabase for further use.

Base Merge

sshivaditya2019 · 2024-10-06T01:52:42Z

sshivaditya2019 · 2024-10-06T02:59:39Z

I tested around with few models, Claude 3.5 Sonnet and OpenAI GPT4o performed the best, other model hallucinated even with very low temperature and top_p value of 0.5 where ever possible.

sshivaditya2019 · 2024-10-06T17:25:58Z

@0x4007 Could you please check the model responses? And are there any questions that could judge the retrieval performance on topics that are discussed very rarely or only once?

0x4007 · 2024-10-06T21:41:50Z

@0x4007 Could you please check the model responses? And are there any questions that could judge the retrieval performance on topics that are discussed very rarely or only once?

Gold star? Non established. Will need to work on this asap.

DM me we can collaborate on this.

0x4007 · 2024-10-06T06:17:27Z

package.json

-  "description": "Ubiquibot plugin template repository with TypeScript support.",
-  "author": "Ubiquity DAO",
+  "description": "A highly context aware organization integrated chatbot",
+  "author": "Ubiquity OS",


Suggested change

"author": "Ubiquity OS",

"author": "Ubiquity DAO",

DAO is the organization.

OS is the software.

DevPool is the community.

0x4007 · 2024-10-06T07:20:56Z

src/helpers/issue-fetching.ts

+        repo: repo || payload.repository.name,
+        issue_number: issueNum || payload.issue.number,
+      })
+      .then(({ data }) => data as Issue);


Pretty unusual syntax to mix async await and then

0x4007 · 2024-10-06T07:21:45Z

src/helpers/issue-fetching.ts

+
+  const issue = await fetchIssue(params);
+
+  let comments: IssueComments | ReviewComments = [];


Does it make sense to have two separate arrays for each data type?

0x4007 · 2024-10-06T07:23:15Z

src/plugin.ts

+export async function runPlugin(context: Context) {
+  const {
+    logger,
+    env: { UBIQUITY_OS_APP_SLUG },


Should be renamed to

Suggested change

env: { UBIQUITY_OS_APP_SLUG },

env: { UBIQUITY_OS_APP_NAME },

sshivaditya2019 · 2024-10-07T04:17:29Z

Model Cost Comparison
This table displays the cost per response for various models based on a avg total of 3,000 tokens (input and output combined).

Model	Cost per Response	Tokens per Second (TPS)
GPT-4o	$0.018775	68
O1 Mini	$0.0232	170
Phi 3.5 Mini	$0.000256	42.3
Gemini Flash 1.5	$0.0000896	177
Claude Sonnet 3.5	$0.0138	66

github-actions · 2024-10-07T07:44:14Z

Unused files (7)

src/handlers/comments.ts, src/helpers/format-chat-history.ts, src/helpers/issue-fetching.ts, src/helpers/issue-handling.ts, src/helpers/issue.ts, src/types/github.ts, src/types/gpt.ts

Unlisted dependencies (5)

Filename	unlisted
src/plugin.ts	`@supabase/supabase-js`
src/adapters/index.ts	`@supabase/supabase-js`
src/adapters/supabase/helpers/comment.ts	`@supabase/supabase-js`
src/adapters/supabase/helpers/supabase.ts	`@supabase/supabase-js`
src/adapters/supabase/helpers/issues.ts	`@supabase/supabase-js`

0x4007 · 2024-10-07T07:44:51Z

Seems like we get what we pay for :)

sshivaditya2019 · 2024-10-09T05:49:11Z

QA:

Can parse through Linked code files in Issue spec and answer questions based on that.

Code Parse #1
Code Parse #2

0x4007 · 2024-10-09T06:11:35Z

Your QA results are quite interesting. We should prompt and focus on brevity.

Can you display (add an extra comment) which shows the entire passed in context? I would like to audit this.

Once this is set up I would like to try asking a couple questions.

sshivaditya2019 · 2024-10-09T06:15:15Z

Your QA results are quite interesting. We should prompt and focus on brevity.

Can you display (add an extra comment) which shows the entire passed in context? I would like to audit this.

Once this is set up I would like to try asking a couple questions.

The plugin is running at test-public repo. You can try it over there. As for the context I think it takes in around 2400 Tokens on avg.

sshivaditya2019 · 2024-10-09T06:18:39Z

On average, these responses cost approximately $0.22, based on an input token count of 2,500 and an output token count of 3,300 on the o1-mini model. While these responses, are quite expensive, these provide a good overview for task.

0x4007 · 2024-10-09T06:20:58Z

Thats mostly fine. Any price these models charge us are orders of magnitude cheaper than developer time, particularly those on base pay.

sshivaditya2019 · 2024-10-09T06:22:51Z

II don't have access to o1-preview model, but I think its responses should be better than o1-mini. I think next step would be parsing pull requests and their review comments. I think this PR should be broken into multiple iterations.

0x4007 · 2024-10-09T06:28:19Z

Actually we should use mini because it has a much larger usable context length. Preview has a lot more internal reasoning tokens spend. I can borrow a key but as I understand both o1 models are available for the same tier of OpenAI account, meaning, if you have access to one, you should have access to both.

Keyrxng added 30 commits July 12, 2024 14:49

chore(deps): openAi

3de383f

chore: settings config

bf8d492

chore: remove supabase

d8de447

feat: add commnt with diff styles

12cbcc4

feat: simple openai chat fn

b9c1ab5

feat: issue related functions

7643b3f

fix: improved context issue filtering

bfac23d

chore: types and plugin entry

554f3f8

feat: issue utils

55d5b2a

feat: chat ready

bd790b5

fix: cspell, eslint

6063f36

feat: ubiquibot-logger

1d51869

fix: ignore all bot comments

6b0333b

chore: use string arrays, remove never configs

a72c97b

feat: deeper linked context fetching

c7b6605

chore: types and eslint ignore .wrangler

94d65e5

chore: simplify main handler

154a9b2

feat: comments handler

2b86ab2

chore: improved context handling

7e6582b

chore: refactor chat formatting, remove no diff error log

8a0a796

refactor: optimizing

834a570

chore: remove env and init tests

c684530

chore: test env setup

c08b9d0

refactor: handle PluginInputs separately for better tests

93e9cd4

chore: setup tests

042bcc0

chore: chat history and linked context tests

6ce964d

chore: remove depth

64bf785

chore: diff comments from logs

f724b85

chore: fix test

053856e

ci: knip

7bd0557

Keyrxng and others added 12 commits September 25, 2024 15:19

chore: optional chaining, try catch blocks

7d2cc57

chore: add token usage in html comment

c498500

chore: typo

ef080c9

chore: readme

c4e98b6

chore: camelCase and add config test

48faffb

chore: hardcode bot name

5076ec8

chore: remove t.optional and add baseUrl check

d8bd296

chore: fix typo

283d6c7

chore: optional endpoint, remove packageManager

6b05fdb

Merge pull request #1 from ubq-testing/development

a1e47df

Base Merge

fix: project setup and supabase setup

e133616

fix: tests

51454d4

feat: basic chat rag works

0f82015

fix: cspell

fa67948

sshivaditya2019 mentioned this pull request Oct 6, 2024

@ubiquityos gpt command #1

Open

0x4007 reviewed Oct 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Search with Context Similarity #2

Search with Context Similarity #2

sshivaditya2019 commented Oct 5, 2024 •

edited

Loading

sshivaditya2019 commented Oct 6, 2024

sshivaditya2019 commented Oct 6, 2024

sshivaditya2019 commented Oct 6, 2024

0x4007 commented Oct 6, 2024 •

edited

Loading

0x4007 Oct 6, 2024

0x4007 Oct 6, 2024

0x4007 Oct 6, 2024

0x4007 Oct 6, 2024

sshivaditya2019 commented Oct 7, 2024

github-actions bot commented Oct 7, 2024

0x4007 commented Oct 7, 2024

sshivaditya2019 commented Oct 9, 2024

0x4007 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024

0x4007 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024 •

edited

Loading

0x4007 commented Oct 9, 2024


		const issue = await fetchIssue(params);

		let comments: IssueComments \| ReviewComments = [];

	env: { UBIQUITY_OS_APP_SLUG },
	env: { UBIQUITY_OS_APP_NAME },

Search with Context Similarity #2

Are you sure you want to change the base?

Search with Context Similarity #2

Conversation

sshivaditya2019 commented Oct 5, 2024 • edited Loading

sshivaditya2019 commented Oct 6, 2024

sshivaditya2019 commented Oct 6, 2024

sshivaditya2019 commented Oct 6, 2024

0x4007 commented Oct 6, 2024 • edited Loading

0x4007 Oct 6, 2024

Choose a reason for hiding this comment

0x4007 Oct 6, 2024

Choose a reason for hiding this comment

0x4007 Oct 6, 2024

Choose a reason for hiding this comment

0x4007 Oct 6, 2024

Choose a reason for hiding this comment

sshivaditya2019 commented Oct 7, 2024

github-actions bot commented Oct 7, 2024

Unused files (7)

Unlisted dependencies (5)

0x4007 commented Oct 7, 2024

sshivaditya2019 commented Oct 9, 2024

0x4007 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024

0x4007 commented Oct 9, 2024

sshivaditya2019 commented Oct 9, 2024 • edited Loading

0x4007 commented Oct 9, 2024

sshivaditya2019 commented Oct 5, 2024 •

edited

Loading

0x4007 commented Oct 6, 2024 •

edited

Loading

sshivaditya2019 commented Oct 9, 2024 •

edited

Loading