Patched

company

https://www.patched.codes/

patchedcodes

patched-codes

Activity Feed

AI & ML interests

code LLMs, static analysis, software composition analysis, vulnerability remediation, application security

Recent Activity

codelion updated a model about 2 months ago

patched-codes/Llama-3.2-1B-FixVulns

codelion updated a model 3 months ago

patched-codes/Llama-3.2-1B-FastApply

codelion updated a Space 4 months ago

patched-codes/analyze-gh-repo

View all activity

patched-codes's activity

codelion

updated a model about 2 months ago

patched-codes/Llama-3.2-1B-FixVulns

Updated Nov 10, 2024

codelion

updated a model 3 months ago

patched-codes/Llama-3.2-1B-FastApply

Updated Oct 25, 2024 • 221 • 3

codelion

updated a Space 4 months ago

Running

📈

patched-codes/generate-readme-eval

Viewer • Updated Sep 21, 2024 • 198 • 111 • 2

patched-codes/static-analysis-eval

Viewer • Updated Sep 13, 2024 • 113 • 329 • 16

codelion

updated a Space 4 months ago

Running

💻

codelion

posted an update 4 months ago

Post

1952

We recently worked with OpenAI to fine-tune gpt-4o and built the SOTA model for the patched-codes/static-analysis-eval benchmark. All the code and data patched-codes/synth-vuln-fixes on how we did it is available on their GitHub - https://github.com/openai/build-hours/tree/main/5-4o_fine_tuning.

Here are some tips based on our experience:

→ Establish baseline with "conditioning" / prompting

→ Task-specific datasets are ideal for PEFT; hard to beat gpt-4o on "broad" tasks

→ Add your best system prompt to each example

→ Ensure training data distribution is similar to inference data

→ Shorten instructions with concise prompts; may require more examples.

→ Define clear evaluation metrics (seriously, please eval!)

You can see more details on the benchmark and process here - https://www.patched.codes/blog/the-static-analysis-evaluation-benchmark-measuring-llm-performance-in-fixing-software-vulnerabilities

codelion

updated a dataset 4 months ago

patched-codes/synth-vuln-fixes

Viewer • Updated Aug 26, 2024 • 201 • 37 • 9

codelion

updated 5 models 5 months ago

patched-codes/patched-phi-3.5-gguf

Updated Aug 21, 2024 • 8

patched-codes/Meta-Llama-3.1-8B-Instruct-bnb-4bit-Patched-LoRA

Updated Aug 19, 2024 • 1

patched-codes/Meta-Llama-3.1-8B-Instruct-bnb-4bit-Patched

Updated Aug 19, 2024 • 20 • 1

patched-codes/Meta-Llama-3.1-70B-Instruct-bnb-4bit-Patched

Updated Aug 10, 2024 • 1 • 1

patched-codes/Meta-Llama-3.1-70B-Instruct-bnb-4bit-Patched-LoRA

Updated Aug 10, 2024 • 1

codelion

posted an update 6 months ago

Post

2513

A new paper titled "STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis" shows the benefits of integrating static analysis with LLMs. (https://arxiv.org/abs/2406.10018)

Authors evaluate 4 key questions:

- How does each static analysis integration strategy perform in LLM-based repository-level code completion?
> They found that integrating static analysis in the prompting phase (especially with file-level dependencies) can achieve the substantially larger improvements than other phases.

- How do different combinations of integration strategies affect LLM-based repository-level code completion?
> Languages that are easier to analyze like Java show more improvements compared to dynamic languages like Python.

- How do static analysis integration strategies perform when compared or combined with RAG in LLM-based repository-level code completion?
> Static analysis and RAG are complementary and boost the overall accuracy.

- What are the online costs of different integration strategies in LLM-based repository-level code completion?
> Combining prompting-phase static analysis and RAG is the best option for cost-effectiveness.

In my @owasp App Sec keynote last year, I had described how one can do static analysis augmented generation (SaAG) to boost the accuracy of LLM based patches for vulnerability remediation. (you can see the talk here - https://www.youtube.com/watch?v=Cw4-ZnUNVLs)

codelion

posted an update 6 months ago

Post

2232

LLM-Assisted Patching of Polyfill Supply Chain Attack

A recent supply chain attack on polyfill.io affected over 100,000 websites (see https://www.patched.codes/blog/patching-the-polyfill-supply-chain-attack). To address this issue, we show how developers can leverage Large Language Models (LLMs) for efficient vulnerability patching:

1. Automated Detection: Using Semgrep rules (see https://semgrep.dev/playground/r/KxUvD7w/asankhaya_personal_org.polyfill-compromise-copy) to identify vulnerable code.

2. LLM-Powered Patching: Utilizing Patchwork (https://github.com/patched-codes/patchwork), an open-source solution that employs LLMs to automatically fix vulnerabilities.

3. Custom Workflows: The "Fixpolyfill" patchflow (https://github.com/patched-codes/patchwork-configs/tree/main/patchflows/Fixpolyfill) , tailored for this specific attack, can be easily run across multiple repositories.

4. Scalable Solutions: Options to scan and patch entire GitHub/GitLab organizations, with automated pull request generation.

5. Rapid Response: LLM-assisted patching enables swift action to minimize damage from supply chain attacks.

This approach demonstrates how LLMs can be effectively used to quickly respond to and remediate widespread security vulnerabilities in code.

codelion

posted an update 7 months ago

Post

7779

The new Claude Sonnet 3.5 model from Anthropic AI has been getting good reviews on since last night. It is quite good at coding related tasks. We tried it on the Static Analysis Eval benchmark ( patched-codes/static-analysis-eval) which measures the ability of a LLM to fix vulnerabilities. The model scores 59.21% which is good but not better than other frontier models (like GPT-4, Gemini-1.5 and LLama-3).

11 replies

codelion

updated a Space 7 months ago

Sleeping

💻

Patched

AI & ML interests

Recent Activity

patched-codes's activity

patched-codes/Llama-3.2-1B-FixVulns

patched-codes/Llama-3.2-1B-FastApply

Patched GH Repo Analyzer

patched-codes/generate-readme-eval

patched-codes/static-analysis-eval

README

Patched MOA: optimizing inference for diverse software development tasks

Patched RTC: evaluating LLMs for diverse software development tasks

Evaluating Pre-trained Language Models for Repairing API Misuses

patched-codes/synth-vuln-fixes

patched-codes/patched-phi-3.5-gguf

patched-codes/Meta-Llama-3.1-8B-Instruct-bnb-4bit-Patched-LoRA

patched-codes/Meta-Llama-3.1-8B-Instruct-bnb-4bit-Patched

patched-codes/Meta-Llama-3.1-70B-Instruct-bnb-4bit-Patched

patched-codes/Meta-Llama-3.1-70B-Instruct-bnb-4bit-Patched-LoRA

Try Patched Chat

AI & ML interests

Recent Activity

Team members 5

patched-codes's activity

Patched GH Repo Analyzer

README

Try Patched Chat