Skip to content

Commit 4f2f231

Browse files
committed
Tweak default built-in document categories
1 parent 809e37a commit 4f2f231

2 files changed

Lines changed: 115 additions & 60 deletions

File tree

.devcontainer/devcontainer.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@
1414
"editor.rulers": [
1515
98
1616
],
17+
"git.inputValidation": true,
1718
"terminal.integrated.defaultProfile.linux": "zsh"
1819
},
1920
"extensions": [

lib/archivist/system_calls.ex

Lines changed: 114 additions & 60 deletions
Original file line numberDiff line numberDiff line change
@@ -8,17 +8,26 @@ defmodule Archivist.SystemCalls do
88
@model "llama3.2"
99
@num_ctx 8192
1010

11-
@categories ~w[
12-
identity
13-
money
14-
medical
15-
insurance
16-
vehicles
17-
legal
18-
education
19-
manuals
20-
miscellaneous
21-
]a
11+
@categories [
12+
"Vital Records and Identification",
13+
"Financial Documents",
14+
"Tax Records",
15+
"Insurance Documents",
16+
"Medical and Health Records",
17+
"Property and Real Estate",
18+
"Housing and Household",
19+
"Vehicle and Transportation",
20+
"Employment and Career",
21+
"Legal and Estate Planning",
22+
"Education and Professional Development",
23+
"Family and Household Members",
24+
"Warranties and Manuals",
25+
"Memberships and Subscriptions",
26+
"Travel and Leisure",
27+
"Digital Assets and Online Accounts",
28+
"Sentimental and Historical",
29+
"Miscellaneous and Other"
30+
]
2231

2332
@slug_length 25
2433

@@ -27,57 +36,102 @@ defmodule Archivist.SystemCalls do
2736
extracted from a PDF, and your job is to return the following information in valid JSON format:
2837
2938
- category (string)
30-
- Must be exactly one of these: #{Enum.join(@categories, ", ")}.
39+
- Must be exactly one of these: #{@categories |> Enum.map(&~s/"#{&1}"/) |> Enum.join(", ")}.
3140
- This refers to the overall subject area or domain of the document.
3241
- Below are the category explanations for reference:
33-
34-
- identity
35-
- Personal identification and official records.
36-
- Examples: Passport, driver's license, social security card, birth certificate, visas,
37-
citizenship papers.
38-
39-
- money
40-
- Banking, taxes, financial records, digital receipts, and home-related financial
41-
matters.
42-
- Examples: Bank statements, credit card statements, IRS tax returns, W-2s, investment
43-
records, mortgage documents, lease agreements, property tax records, home-related
44-
bills (utilities, internet, etc.), software receipts.
45-
46-
- medical
47-
- Health records, insurance claims, and medical history.
48-
- Examples: Doctor visit summaries, prescriptions, lab results, vaccination records,
49-
dental & vision records.
50-
51-
- insurance
52-
- All types of insurance policies and claims.
53-
- Examples: Health insurance, auto insurance, home insurance, life insurance, policy
54-
renewal notices.
55-
56-
- vehicles
57-
- Documents related to vehicle ownership, maintenance, and insurance.
58-
- Examples: Car title, registration, loan documents, repair receipts, maintenance logs,
59-
DMV paperwork.
60-
61-
- legal
62-
- Legal documents, employment contracts, estate planning, and home-related legal
63-
records.
64-
- Examples: Wills, trusts, power of attorney, court records, notarized documents,
65-
employment contracts, business ownership documents, home repair contracts, renovation
66-
permits, legal agreements related to property.
67-
68-
- education
69-
- Academic records, certifications, and professional development.
70-
- Examples: Diplomas, transcripts, student loans, course certificates, professional
71-
training records.
72-
73-
- manuals
74-
- Instruction manuals, warranties, and documentation for products you own.
75-
- Examples: Appliance manuals, electronics guides, furniture assembly instructions,
76-
vehicle owner's manuals.
77-
78-
- miscellaneous
79-
- A catch-all for documents that don't fit any other category.
80-
- Examples: Unsorted files, temporary documents, one-off records.
42+
- Vital Records and Identification
43+
- Description: Documents that establish or verify an individual's identity and
44+
significant life events.
45+
- Examples: Birth certificates, marriage or divorce certificates, death certificates
46+
(for family members), passports, Social Security cards (or equivalents), citizenship
47+
or naturalization papers, name change documents.
48+
- Financial Documents
49+
- Description: Paperwork related to banking, credit, investments, and recurring
50+
expenses.
51+
- Examples: Bank statements, credit card statements, loan agreements (mortgage,
52+
student, car), investment records (stocks, bonds, mutual funds, cryptocurrency),
53+
budget worksheets, utility bills, subscription invoices.
54+
- Tax Records
55+
- Description: All documents needed for tax filing, verification, and historical
56+
reference.
57+
- Examples: Past tax returns, W-2/1099 forms (or international equivalents), receipts
58+
for deductible expenses (charitable donations, medical, business), property tax
59+
statements.
60+
- Insurance Documents
61+
- Description: Policies and claims information for various types of insurance.
62+
- Examples: Health insurance policy details, life insurance contracts, auto or
63+
homeowners policies, coverage schedules, claim forms, renewal notices.
64+
- Medical and Health Records
65+
- Description: Personal and family health documentation, including treatments and
66+
prescriptions.
67+
- Examples: Immunization records, physician or hospital visit summaries, lab test
68+
results, prescription information, dental/vision care records, documentation of
69+
chronic conditions.
70+
- Property and Real Estate
71+
- Description: Paperwork detailing real property ownership, transactions, and
72+
improvements.
73+
- Examples: Mortgage agreements, deeds and titles, closing documents, lease agreements
74+
for rental properties, receipts for major renovations, HOA (Homeowners Association)
75+
guidelines.
76+
- Housing and Household
77+
- Description: Day-to-day living documents and service agreements for your home.
78+
- Examples: Rental lease agreements (if renting), utility contracts and bills
79+
(electricity, water, internet), service or maintenance contracts (e.g., lawn care,
80+
pest control), appliance manuals, home repair receipts.
81+
- Vehicle and Transportation
82+
- Description: Records associated with car ownership, maintenance, and usage.
83+
- Examples: Vehicle titles, registration papers, auto insurance policies, maintenance
84+
and service records, warranty details, driver's license copies, parking permits.
85+
- Employment and Career
86+
- Description: Information related to current and past employment, as well as
87+
professional growth.
88+
- Examples: Employment contracts, offer letters, pay stubs, performance evaluations,
89+
benefits guides, separation or termination documents, professional certifications,
90+
résumés/CVs.
91+
- Legal and Estate Planning
92+
- Description: Legally binding papers covering estates, end-of-life directives, and
93+
other legal matters.
94+
- Examples: Wills, trusts, power of attorney documents, living wills or advance
95+
directives, guardianship papers, and court or legal settlement documents.
96+
- Education and Professional Development
97+
- Description: Records of academic achievements, certifications, and ongoing
98+
education.
99+
- Examples: Transcripts, diplomas, course certificates, scholarships or grant info,
100+
professional licenses, continuing education credits, conference attendance records.
101+
- Family and Household Members
102+
- Description: Personal documents specific to each household member or dependent.
103+
- Examples: Spouse or partner's documents (if kept separately), children's birth
104+
certificates, school records, immunization details, childcare arrangements, pet
105+
adoption or vaccination papers.
106+
- Warranties and Manuals
107+
- Description: Documentation for product guarantees and user guides.
108+
- Examples: Warranty information for electronics or appliances, user manuals, extended
109+
service contracts, purchase receipts for large items or equipment.
110+
- Memberships and Subscriptions
111+
- Description: Details on recurring membership-based services or organizations.
112+
- Examples: Gym memberships, club or association memberships, magazine or streaming
113+
subscriptions, loyalty or frequent flyer program statements, renewal notices.
114+
- Travel and Leisure
115+
- Description: Arrangements and records related to vacations, trips, and leisure
116+
activities.
117+
- Examples: Travel itineraries, flight tickets, hotel confirmations, visa
118+
documentation, travel insurance policies, timeshare contracts, past trip expense
119+
receipts.
120+
- Digital Assets and Online Accounts
121+
- Description: Information and credentials for online identities, cloud services, and
122+
digital platforms.
123+
- Examples: Password manager references (stored securely), domain registrations, cloud
124+
storage subscriptions, digital payment account details (PayPal, etc.), important
125+
email or social media account notes.
126+
- Sentimental and Historical
127+
- Description: Keepsakes and personal or family history items with emotional or
128+
genealogical importance.
129+
- Examples: Family photos, letters, journals, genealogy research, copies of heirlooms,
130+
scrapbooks, memorabilia.
131+
- Miscellaneous and Other
132+
- Description: A catch-all for documents that do not neatly fit into other categories.
133+
- Examples: Personal or hobby-related projects, unusual one-off contracts, event
134+
memorabilia, or temporary items awaiting proper classification.
81135
82136
- date (string)
83137
- Must be a valid ISO 8601 date in the format YYYY-MM-DD.

0 commit comments

Comments
 (0)