Question 1

What is the difference between character count and byte count?

Accepted Answer

Characters and bytes are not always the same. ASCII characters (A–Z, 0–9, basic punctuation) are 1 byte each in UTF-8. Most accented Latin characters (é, ü, ñ) are 2 bytes. Many emoji are 4 bytes. Chinese/Japanese/Korean characters are typically 3 bytes. If your system has a byte limit (database column size, API request limit, file size constraint), use the byte count — not the character count.

Question 2

Why does my string have more bytes than characters?

Accepted Answer

Because one or more characters in your text require multiple bytes to encode in UTF-8. This is normal for any text containing emoji, accented characters, or non-Latin scripts. The byte/character ratio column shows the average bytes per character — a ratio above 1.0 means multi-byte characters are present. The tool highlights this in amber as a heads-up.

Question 3

What does the JSON validation check?

Accepted Answer

It runs your text through JSON.parse() and reports whether it's valid JSON. If valid, it shows the number of top-level keys (for objects) or items (for arrays). It does not validate against a schema — just that the syntax is correct. Useful for verifying API responses, config files, or data exports before piping them into another tool.

Question 4

What does the CSV detection check?

Accepted Answer

It checks whether the input looks like CSV by confirming: every line has the same number of comma-separated fields, and there are at least 2 rows and 2 columns. If detected, it shows the row count and column count. It assumes comma delimiter — it does not currently detect tab-separated or semicolon-separated files. For a more robust CSV audit, use a dedicated CSV linter.

Question 5

What are non-printable characters and why do they matter?

Accepted Answer

Non-printable characters are Unicode code points that have no visible glyph: null bytes (\x00), carriage returns without matching newlines, zero-width spaces (U+200B), Unicode bidirectional override characters (U+200F, U+202E), and others. They're invisible in most text editors but break databases, APIs, and parsers. They can also be used to obfuscate malicious content in strings that appear safe. The non-printable count turns red when any are detected.

Character Counter (Dev)

How to use the dev character counter

How byte count is calculated

Why developers need character vs byte awareness

Frequently asked questions

Tools you might like