# XX — Other Scripts Archive

> ⚠️ **DEPRECATED / ARCHIVE** — These are historical script versions, preserved for reference. None are actively maintained. Use the current production tools instead.

> **Status:** Deprecated / Archive
> **Category:** Deprecated
> **Location:** `XX - Other Scripts/script junk unsorted/`

## Overview

The `XX - Other Scripts` directory is an archive of earlier script versions and experimental tools that predate the current production scripts. It contains multiple iterations of the VGMdb collection scraper and an unrelated web text extraction utility. All versions are superseded by the scripts documented under [VGMdb Scrapers](../vgmdb-scrapers/).

## Version History

The table below maps each archived folder to its approximate version in the development timeline, based on feature additions visible in each README.

| Folder | Tool Name | Key Features | Notes |
|--------|-----------|--------------|-------|
| `collectionscraper/` | VGMdb Collection Scraper | Login + scrape, Excel export, sortable/searchable table, clipboard copy, pagination (25/50/100/250/500/All) | Earlier version — no server-side session storage |
| `VgmdbScraper(2)/VgmdbScraper/` | VGMdb Collection Scraper | Same as above + **server-side session storage** (no 4KB cookie limit) | Adds session storage; no PostgreSQL |
| `VgmdbScraper(custom sorting)/VgmdbScraper/` | VGMdb Collection Scraper | Same as above + **PostgreSQL integration**, custom sorting (alphabetical or original order) | Adds persistent DB and sorting UI |
| `VgmdbScraper(custom sorting and no userid)/VgmdbScraperv4/` | VGMdb Collection Scraper | Same as custom sorting version — identical README | Variant without user ID field (name suggests it); `v4` in folder name |
| `Spreadsheet VGMDB/SpreadsheetVGMDB/` | VGMdb Scraper (URL-based) | Input multiple album URLs, extract Album ID / Title / Format / Category / Classification, clipboard copy for Excel, Docker support | **Different tool** — scrapes individual album pages by URL rather than a user's full collection |
| `[Scripts]/WebTextExtractor(1)/WebTextExtractor/` | Web Text Extractor | Streamlit UI, extract text from multiple URLs using CSS selectors (ID, class, tag, custom), CSV export | **Unrelated tool** — general-purpose web scraper using Streamlit; pre-configured for VGMdb album title selector |

## Tool Descriptions

### VGMdb Collection Scraper (multiple versions)

A Flask web application that logs into vgmdb.net with user credentials, scrapes the authenticated user's album collection, and presents the data in a sortable, searchable web table. All versions support Docker deployment and include Excel export and clipboard copy functionality.

**Feature progression:**

1. Base version (`collectionscraper/`) — core login + scrape workflow
2. Server-side sessions (`VgmdbScraper(2)/`) — removes the 4KB cookie size limit for large collections
3. PostgreSQL + custom sorting (`VgmdbScraper(custom sorting)/`) — adds persistent database and alphabetical/original-order sorting toggle
4. v4 variant (`VgmdbScraper(custom sorting and no userid)/`) — appears identical to step 3; folder name suggests a variant without the "scrape another user's ID" feature

**Current production equivalent:** See [VGMdb Scrapers](../vgmdb-scrapers/docker-vgmdb-scraper.md)

### VGMdb Scraper — URL-based (`Spreadsheet VGMDB/`)

A separate Flask application (also Docker-compatible) that accepts individual album URLs as input rather than logging into an account. Extracts: Album ID, Title, Publish Format, Category, Classification. Results are displayed in a table and can be copied to clipboard in Excel-ready format. Uses a 2-second delay between requests.

### Web Text Extractor (`[Scripts]/WebTextExtractor(1)/`)

A Streamlit application for extracting text from arbitrary web page URLs using configurable CSS selectors. Not specific to VGMdb but was pre-configured with the VGMdb album title selector (`span.albumtitle[lang='en']`). Supports ID, class, tag, and custom CSS selectors. Exports results to CSV.

## Running These Archived Versions

These scripts are not recommended for use. They are preserved for historical reference. If you need to run one, each folder contains its own `README.md` with Docker Compose instructions:

```bash
# Example — generic Docker pattern used by all collection scraper versions
cd "XX - Other Scripts/script junk unsorted/<folder-name>/"
docker-compose up -d
# Access at http://localhost:5000
```

## Notes

- The `[Scripts]` folder name uses square brackets, which may require quoting in shell commands.
- All collection scraper versions use the same security caveat: credentials are held in memory and not persisted to disk, but should not be used in shared environments.
- The `Spreadsheet VGMDB` version is oriented toward building a spreadsheet from known album URLs, while the collection scraper versions are for bulk-exporting a user's entire owned collection.

## Related Scripts

- [VGMdb Scrapers](../vgmdb-scrapers/docker-vgmdb-scraper.md) — current production collection scraper
- [VGMdb Reprint Finder (Deprecated)](vgmdbcrawl-reprintfinder-deprecated.md) — deprecated reprint crawlers
- [XX Backups and Outputs](xx-backups-outputs.md) — archive directories for output files and backups
