← Back to Skills

pyspark-schema-converter

v1.0.0 approved Data Processing ⬇ 4 ↑ 1/7d 3d ago
✅ Verified
⬇ Download Install Guide↓
🤖 Agent install commands (curl / MCP / Claude Desktop)
▸ curl one-liner
curl -L -o pyspark-schema-converter.skill   "https://aiskillstore.io/v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download?platform=ClaudeCode"
▸ MCP tool call (after registering Skill Store MCP)
{
  "tool": "download_skill",
  "arguments": {
    "skill_id": "6a19a067-93da-4b65-88f2-9e3f5d2461fb",
    "platform": "ClaudeCode"
  }
}
▸ Claude Desktop / Cursor MCP config (one-time)
{
  "mcpServers": {
    "skill-store": {
      "url": "https://aiskillstore.io/mcp/"
    }
  }
}
📖 Full agent API guide: /llms.txt  ·  MCP server card

Convert JSON Schema / DDL / CSV headers to PySpark StructType code, and reverse-convert PySpark schemas to Avro or JSON Schema. No Spark runtime required.

# pyspark # schema # etl # ddl # avro # data-pipeline # spark

Basic Info

Owner 👤 aiskillstore-team Category Data Processing Registered 2026-06-12 Last Updated 2026-06-12 Latest Version 1.0.0 Packaged At 2026-06-12 Vetting Status approved Downloads 4 Checksum (SHA256) 33b361cc51acc77124b40a9d0872f78908fffc1fe74c0ed0490f7799da3f39a2

Installation

Compatible Platforms any

1
Install the skill using openclaw_skill_manager.py.
python openclaw_skill_manager.py --install pyspark-schema-converter
2
Verify installation
python openclaw_skill_manager.py --list-installed
3
Install a specific version (optional)
python openclaw_skill_manager.py --install pyspark-schema-converter --version 1.0.0
1
Download the skill package.
curl -O https://aiskillstore.io/v1/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download
2
Place it in the Claude Code commands directory.
unzip pyspark-schema-converter.skill -d ~/.claude/commands/pyspark-schema-converter/
3
Use it as a slash command in Claude Code.
/pyspark-schema-converter
1
Download the Agent Skills package.
curl -O https://aiskillstore.io/v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download?platform=ClaudeCodeAgentSkill
2
Unzip it into the Claude Code skills directory.
unzip pyspark-schema-converter-agent-skill-*.skill -d ~/.claude/skills/pyspark-schema-converter/
3
Restart Claude Code — the skill is auto-loaded at session start. No slash command needed.
1
Download the Cursor-converted package.
curl -O https://aiskillstore.io/v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download?platform=Cursor
2
Unzip and place it in a permanent location.
unzip pyspark-schema-converter-cursor-*.skill -d ~/.cursor/skills/pyspark-schema-converter/
3
Add the MCP server config to .cursor/mcp.json, then restart Cursor.
cat ~/.cursor/skills/pyspark-schema-converter/cursor_mcp_config.json
1
Download the Gemini CLI-converted package.
curl -O https://aiskillstore.io/v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download?platform=GeminiCLI
2
Unzip and place it in a permanent location.
unzip pyspark-schema-converter-geminicli-*.skill -d ~/.gemini/skills/pyspark-schema-converter/
3
Add the MCP server config to ~/.gemini/settings.json, then restart Gemini CLI.
cat ~/.gemini/skills/pyspark-schema-converter/gemini_settings_snippet.json
1
Download the Codex CLI-converted package.
curl -O https://aiskillstore.io/v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download?platform=CodexCLI
2
Unzip and place it in a permanent location.
unzip pyspark-schema-converter-codexcli-*.skill -d ~/.codex/skills/pyspark-schema-converter/
3
Add the MCP server config to ~/.codex/config.toml, then restart Codex CLI.
cat ~/.codex/skills/pyspark-schema-converter/codex_config_snippet.toml
1
Download the skill package via REST API.
GET https://aiskillstore.io/v1/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/download
2
Place it in your agent platform's skills directory.
cp pyspark-schema-converter.skill ./skills/
3
Fetch platform-specific details via the Install Guide API.
GET https://aiskillstore.io/v1/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/install-guide?platform=CustomAgent

Security Vetting Report

Vetting Result APPROVED

Findings: ["메타데이터 경고: 권장 필드 없음: 'requirements' (SKILL.md v2 권장)", "메타데이터 경고: 권장 필드 없음: 'changelog' (SKILL.md v2 권장)", '정보: spec: usk/1.0 미선언 — v2 패키지로 처리됩니다. 자동 변환 및 에이전트 검색 기능을 사용하려면 USK v3로 업그레이드하세요.']

✅ No security risks found.

AI Review Stage

Reviewer gemini Risk Level 🟢 Low Review Summary 선언된 제한적인 권한을 준수하며, 악의적인 동작이나 외부 통신이 없는 안전한 스킬입니다.
Reasoning

스킬 메타데이터에 명시된 `network: false`, `filesystem: false`, `subprocess: false` 권한 선언과 실제 코드의 동작이 완벽하게 일치합니다. 코드 분석 결과, `sys`, `json`, `os`, `re`, `csv`, `io` 등 표준 라이브러리만을 사용하며, 이들 또한 스킬의 핵심 기능(입력/출력 처리, 문자열 파싱, JSON 직렬화/역직렬화)을 수행하는 데 필요한 범위 내에서만 사용됩니다. - `os` 모듈은 `sys.path.insert`를 위한 경로 설정에만 사용되며, 파일 시스템에 대한 임의 접근은 없습니다. - `io.StringIO`는 메모리 내 문자열 처리에 사용될 뿐, 실제 파일 I/O를 수행하지 않습니다. - `subprocess` 모듈이나 외부 명령 실행 코드는 발견되지 않았습니다. - `requests`, `urllib` 등 네트워크 통신을 위한 라이브러리 사용이 전혀 없습니다. - 사용자 입력 스키마를 파싱하고 변환하는 것이 스킬의 목적이며, 이 과정에서 사용자 데이터를 무단으로 수집하거나 외부로 전송하는 행위는 발견되지 않았습니다. - 정적 분석 결과에서도 'red_flags_found', 'obfuscation_warnings', 'forbidden_exec_files_found' 항목이 모두 비어 있어 코드의 안전성을 뒷받침합니다. 전반적으로 코드 품질이 우수하고, 스킬의 목적에 부합하며, 보안 위험 요소가 없는 것으로 판단됩니다.

Version History

Version USK v3 Vetting Status Packaged At Downloads Changelog
v1.0.0 approved 2026-06-12 ⬇ 4

Examples 7

Representative input/output examples for this skill. Agents can use these to understand how to invoke the skill and what output to expect.

json_schema_to_pyspark
# json_schema# nested# array

Nested JSON Schema (user profile with address object and tags array) -> PySpark StructType Python code

📥 Input
{
  "input_format": "json_schema",
  "input_schema": "{\"type\":\"object\",\"required\":[\"user_id\",\"email\"],\"properties\":{\"user_id\":{\"type\":\"integer\",\"format\":\"int64\"},\"email\":{\"type\":\"string\"},\"address\":{\"type\":\"object\",\"properties\":{\"city\":{\"type\":\"string\"},\"zip\":{\"type\":\"string\"}}},\"tags\":{\"type\":\"array\",\"items\":{\"type\":\"string\"}}}}"
}
📤 Output
{
  "field_count": 4,
  "input_format_detected": "json_schema",
  "output": "PySpark StructType code with LongType for user_id, StringType for email, nested StructType for address, ArrayType(StringType()) for tags",
  "success": true,
  "warnings": []
}
mysql_ddl_to_pyspark
# ddl# mysql# decimal# bigint

MySQL CREATE TABLE with BIGINT NOT NULL and DECIMAL(10,2) -> accurate LongType and DecimalType

📥 Input
{
  "input_format": "ddl",
  "input_schema": "CREATE TABLE orders (id BIGINT NOT NULL, amount DECIMAL(10,2), status VARCHAR(50), created_at TIMESTAMP);"
}
📤 Output
{
  "field_count": 4,
  "input_format_detected": "ddl",
  "output": "PySpark StructType code: id as LongType(nullable=False), amount as DecimalType(10,2), status as StringType, created_at as TimestampType",
  "success": true,
  "warnings": []
}
csv_header_type_inference
# csv# type-inference# csv_header

CSV header with sample rows -> inferred PySpark types (IntegerType, DateType, BooleanType)

📥 Input
{
  "csv_sample_rows": [
    "1001,alice@example.com,2024-01-15,true",
    "1002,bob@example.com,2024-03-22,false",
    "1003,carol@example.com,2024-07-01,true"
  ],
  "input_format": "csv_header",
  "input_schema": "user_id,email,created_at,is_active"
}
📤 Output
{
  "field_count": 4,
  "input_format_detected": "csv_header",
  "output": "PySpark StructType code: user_id as IntegerType, email as StringType, created_at as DateType, is_active as BooleanType",
  "success": true,
  "warnings": []
}
pyspark_json_to_avro
# avro# reverse-conversion# pyspark_json

PySpark schema.json() format -> Apache Avro schema with logicalType annotations

📥 Input
{
  "input_format": "pyspark_json",
  "input_schema": "{\"type\":\"struct\",\"fields\":[{\"name\":\"order_id\",\"type\":\"long\",\"nullable\":false,\"metadata\":{}},{\"name\":\"total\",\"type\":{\"type\":\"decimal\",\"precision\":10,\"scale\":2},\"nullable\":true,\"metadata\":{}},{\"name\":\"status\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}}]}",
  "output_format": "avro"
}
📤 Output
{
  "field_count": 3,
  "input_format_detected": "pyspark_json",
  "output": "Avro record schema with long for order_id, decimal logicalType for total, string for status",
  "success": true,
  "warnings": []
}
nested_array_of_objects
# nested# array-of-objects# nullable# anyOf

JSON Schema with array of objects and anyOf nullable -> ArrayType(StructType) recursive conversion

📥 Input
{
  "input_format": "json_schema",
  "input_schema": "{\"type\":\"object\",\"properties\":{\"order_id\":{\"type\":\"integer\"},\"items\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"sku\":{\"type\":\"string\"},\"qty\":{\"type\":\"integer\",\"format\":\"int32\"},\"price\":{\"type\":\"number\"}}}},\"tags\":{\"anyOf\":[{\"type\":\"array\",\"items\":{\"type\":\"string\"}},{\"type\":\"null\"}]}}}"
}
📤 Output
{
  "field_count": 3,
  "input_format_detected": "json_schema",
  "output": "PySpark StructType with LongType for order_id, ArrayType(StructType with IntegerType/DoubleType) for items, nullable ArrayType(StringType) for tags",
  "success": true,
  "warnings": []
}
invalid_ddl_error
# error-handling# ddl# validation

Malformed DDL input -> PARSE_ERROR with structured error response

📥 Input
{
  "input_format": "ddl",
  "input_schema": "THIS IS NOT VALID DDL AT ALL"
}
📤 Output
{
  "error": {
    "code": "PARSE_ERROR",
    "message": "PARSE_ERROR: Could not find column definition block \u0027(...)\u0027. Ensure input is a valid CREATE TABLE statement."
  },
  "success": false
}
ddl_to_json_schema
# json_schema# output-format# api-contract

PostgreSQL DDL -> JSON Schema output (for API contract generation)

📥 Input
{
  "input_format": "ddl",
  "input_schema": "CREATE TABLE products (sku VARCHAR(100) NOT NULL, price DECIMAL(12,4), stock INTEGER, last_updated TIMESTAMP);",
  "output_format": "json_schema"
}
📤 Output
{
  "field_count": 4,
  "input_format_detected": "ddl",
  "output": "JSON Schema with string for sku (required), number/decimal for price, integer for stock, string/date-time for last_updated",
  "success": true,
  "warnings": []
}

All examples are also available via the agent API: /v1/agent/skills/6a19a067-93da-4b65-88f2-9e3f5d2461fb/schema

Reviews & Ratings

No reviews yet. Be the first to leave one!

✍️ Write a Review