Compare commits
3 Commits
Author | SHA1 | Date | |
---|---|---|---|
![]() |
b35e50577f | ||
![]() |
d077827c11 | ||
![]() |
312229c8d4 |
24
.github/workflows/code_review.yml
vendored
Normal file
24
.github/workflows/code_review.yml
vendored
Normal file
@@ -0,0 +1,24 @@
|
|||||||
|
name: Code Review
|
||||||
|
|
||||||
|
permissions:
|
||||||
|
pull-requests: write
|
||||||
|
statuses: write
|
||||||
|
checks: write
|
||||||
|
contents: read
|
||||||
|
actions: read
|
||||||
|
|
||||||
|
on:
|
||||||
|
workflow_dispatch:
|
||||||
|
push:
|
||||||
|
branches: [ci-dev]
|
||||||
|
|
||||||
|
jobs:
|
||||||
|
code_review:
|
||||||
|
permissions:
|
||||||
|
contents: read
|
||||||
|
runs-on: ubuntu-latest
|
||||||
|
steps:
|
||||||
|
- uses: actions/checkout@v4
|
||||||
|
- uses: geoffsee/toakinize@v2.0.0
|
||||||
|
env:
|
||||||
|
OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
|
30
README.md
30
README.md
@@ -1,21 +1,25 @@
|
|||||||
# code-tokenizer
|
# toak
|
||||||
|
it's no joke
|
||||||
|
|
||||||
[](https://www.npmjs.com/package/toak)
|
[](https://www.npmjs.com/package/toak)
|
||||||

|

|
||||||
[](https://www.gnu.org/licenses/agpl-3.0.html)
|
[](https://www.gnu.org/licenses/agpl-3.0.html)
|
||||||
|
|
||||||
## Overview
|
## Overview
|
||||||
|
|
||||||
`toak` is a cli tool, named for phonetics, that processes git repository files, cleans code, redacts sensitive information, and generates a `prompt.md` with token counts using the Llama 3 tokenizer.
|
`toak` is an intentionally simple yet powerful tool that processes git repository files, cleans code, redacts sensitive information, and generates markdown documentation with token counts using the Llama 3 tokenizer.
|
||||||
|
|
||||||
```shell
|
```shell
|
||||||
$ cd your-git-repo
|
$ cd your-git-repo
|
||||||
$ npx toak
|
$ npx toak
|
||||||
```
|
```
|
||||||
|
|
||||||

|

|
||||||
|
|
||||||
|
## Philosophy
|
||||||
|
1. _Human-first_ technologies for a better future.
|
||||||
|
2. If you don't like the name...good.
|
||||||
|
---
|
||||||
|
|
||||||
## Features
|
## Features
|
||||||
|
|
||||||
@@ -25,10 +29,14 @@ $ npx toak
|
|||||||
- Redacts sensitive information (API keys, tokens, JWT, hashes)
|
- Redacts sensitive information (API keys, tokens, JWT, hashes)
|
||||||
- Counts tokens using llama3-tokenizer-js
|
- Counts tokens using llama3-tokenizer-js
|
||||||
- Supports nested .toak-ignore files
|
- Supports nested .toak-ignore files
|
||||||
|
|
||||||
|
### Token Cleaning
|
||||||
- Removes single-line and multi-line comments
|
- Removes single-line and multi-line comments
|
||||||
- Strips console.log statements
|
- Strips console.log statements
|
||||||
- Removes import statements
|
- Removes import statements
|
||||||
- Cleans up whitespace and empty lines
|
- Cleans up whitespace and empty lines
|
||||||
|
|
||||||
|
### Security Features
|
||||||
- Redacts API keys and secrets
|
- Redacts API keys and secrets
|
||||||
- Masks JWT tokens
|
- Masks JWT tokens
|
||||||
- Hides authorization tokens
|
- Hides authorization tokens
|
||||||
@@ -37,8 +45,16 @@ $ npx toak
|
|||||||
|
|
||||||
## Requirements
|
## Requirements
|
||||||
|
|
||||||
- npm/bun/yarn/pnpm
|
- Node.js (>=14.0.0)
|
||||||
|
- Git repository
|
||||||
|
- Bun runtime (for development)
|
||||||
|
|
||||||
|
## Installation
|
||||||
|
|
||||||
|
```bash
|
||||||
|
npm install toak
|
||||||
|
```
|
||||||
|
|
||||||
## Usage
|
## Usage
|
||||||
|
|
||||||
### CLI
|
### CLI
|
||||||
|
Reference in New Issue
Block a user