NodeJS

Enterprise Node by Khalilstemmler

DTOs as Layer of Indirection

Different parts of software must be connected in order to do somethings useful, BUT too tight connection will cause waterfall changes into downstream, when upstream introduce some change (thigh coupling)

Data Transfer Objects is one way to lower coupling, by introducing mapping of DB/ORM models into our own DTOs, thus

introducing single point of change (by adding single mapper)
hiding implementation details of DB
- dependency inversion
allowing for changes more easily
- the easier is to change design decision the better
creating a contract for API

You have domain objects (that your domain layer needs), data transfer objects (that your clients need), and the raw sequelize objects (that your database needs). The mapper handles transformations between all of ‘em.

Event Based Systems

Event Based Systems go hand-in-hand with DDD and is a great way of thinking and dealing with complex problems, because it is:

easy to express communication, ex: UserRegistered -> EmailVerificationSent -> EmailVerified
possible to record system state as history of events
possible to scale by making heavy operations async
reduce complexity via indirection

Functional Error Handling

Happy path is not only one, so we need a way to deal with errors

errors represent state and need to be handled properly in proper place
throwing errors is mass disruption to flow of code, that can be seen as kinda of a goto AND also it is just harder to reason about your program

Errors can and should be a part of domain model, strictly defined on interface, so it is easier to work with them

this leads to function or golang or error as value approach, where you have concepts like: Result<T> OR even better Either<E, T>, so you can return T value OR E error
- it is pretty hard to work with types, when errors are “thrown” all over the place
- alternatives are:
  - return null - lead to impossibility or error handling, because no errors
  - throw error - harder to type + constant try/catch
- in model above each error must have strictly defined type, so it is identifiable in any part of the code it propagates to
- when working with Result you can create combination functional, that will calculate overall Result from array of Results
in our model errors can be generic: Unauthorized, Unknown; domain related: business(UserNameAlreadyTaken), general(UserUnknownError)

note that is is more of a solution for your custom app, otherwise you should:

for libs - throw OR return error in first callback, due to it been standard
- btw, when working with external lib in your app you can and should create adaptors to match your style
for broken state of an app - throw via invariant/assert to identify incorrect flow of code
for unknown errors - throw, catch somewhere on top, log, return 500(or similar action)

note that such approach is harder to adopt, but worth on scale

Use-case layer

We can keep application logic as layer of use-cases, thus keeping all logic encapsulated and organized

use-case must:

have actor, who would use it
be command OR query
be related to subdomain(a logical separation of the entire problem domain)(don’t forget to distinguish between core and generic subdomains)
- remember about Conway’s law: Organizations that design systems, are constrained to produce designs that are copies of the communication structures of these organizations.
- it must be possible to extract subdomain as micro-services

generally we will have such layers:

domain to keep domain types and logic
infra to keep networking, DB etc
use-case layer to bound domain with infra with additional app logic

notes:

your project structure must give basic proper understanding of your system
use-cases must be executable in any env, if all dependencies are provided, meaning that we could pass them to server controller OR to test
use-cases can use other use-cases

Clean Architecture

Our app must distinguish business logic(abstract, declarative) and infra logic(implementation focused, often imperative)

Business logic can’t depend on infra

Define interfaces as strict contracts AND then define implementations of those interface for specific cases(ex: tests, server controller)

by passing implementations as dependencies you enable testability and flexibility of your code
- it is easy to test domain, it is harder and slower to test infra, BUT infra is often external(ex: Redis), so no much need to do so

The only way we can be certain about changing code is to have written tests for it.

Clean Contollers

We need to have some basic controller functional, that all other controllers will inherit, like:

take req and send resp
- it also includes proper parsing AND stringifying
return 200/201 with/without data
return 400/similar with custom error
return 500/similar with default error
- includes loggin, ideally via DI passed logger

Understanding Core Concepts

Section will be about Node.js AND related BE topics

Intro

CLI

Command prompt - some info, that terminal will show to you all the time Command line - input field to execute commands Terminal - app to access CLI Instruction - instruction to execute

instructions are written in some scripting language

commands:

cd dir - go to directory
- cd == change dir
mkdir name name2 ... - create folder
- other commands also can chain inputs like so
pwd - print current full path
touch name.ext - create file
echo $var - print string/parameter to console
- $var is syntax to reference variable
ls filter - list content of current dir
- filter is glob patter (optional)
- -a - show hidden files
mv path1 path2 - move OR rename file/folder
ssh - connect to external server
vim/nano - terminal text editors
cat file - print content of a file
rm path - remove path
man command - command docs
open path - open path with Finder

shortcuts:

ctr+A - go to the beginning of a line
ctr+E - go to the end of a line
arrow-[up/down] - move between command history

notes:

~ == home dir
bash is stored as .sh
.. go up a dir
| - pipe output of command to other command
> - extract output of command
commands can be ran from inside, piped into and from a node.js

Node versioning

Always use LTS for production (LTS changes every 2 years)

To switch version you can use nvm:

ls - list of installed node versions
(un)install version (un)install new version of node
use version use version of node
---
npm -g will store packages scoped by version
.nvmrc can be used to create dir scoped configs of nvm WITH possibility to always use specific node version(by specifying it at first line)
- note: you still need to manually write nvm use

NodeJS Under the Hood

Programing is basically talking to CPU, which requires Machine Code(low level set of instruction, uniq per processor architecture), which can be generated from Assembly Language, which can be generated from high-low-level programing language like C/C++, that expose low-level details to coder, which can be abstracted via high-level language, that abstract implementation details

JS is example of such high-level language, while NodeJS is something, that makes it possible to run JS on device
JS is first of all run by engine, like V8(embedded in NodeJS), SpiderMonkey etc AND only then embedded into other apps like NodeJS, Chrome etc, that expose additional functionality and syntax to it(ex: window, global etc), that is bound to JS
- proper engine must implement ECMAScript specification, that states how JS must be ran

First major Node’s dependency is V8 to run JS, while second is libuv, library that exposes functionality to interact with OS, in order for Node to be properly used as server-side language

Other important Node concept is Event Loop, that handles code execution and proper interaction between libuv and V8, in such way, that it enables us to write non-blocking async code

non-blocking operations are achieved via event-driven nature of Node, where you say to do start doing somethings AND when this something is done execute smth else OR send event about the result of operation
related commands:
- process.nextTick(cb) - execute callback after main process is done
watch out for operations that block main thread, such action will block both V8 and Event Loop
- basically this happens due to Node been single threaded by it’s nature
  - this still implies possibility of spinning up several instances of Node and communicate between them in multi-threaded manner
  - libuv have N(default = 4) ports to do sys-operations
    - Node aims to use as low number of threads as possible
- in general avoid: heavy operations done in one byte(regex, file processing), heavy sync functions, complex operations

EventEmitter

Object, that Node exposes to utilize event-driven communication via Observer pattern

EventEmitter is implemented in pure JS and can be reimplemented without Node
events are used as a way to reduce overload, that otherwise will occur if we will do pseudo-events via something like polling(sync event management)

ex:

const EventEmmiter = require("events");

const e = new EventEmmiter();

e.on("eventName", () => undefined);
e.once("eventName", () => undefined); // run only once
e.off("eventName", someCB) // unsubscribe from event

e.emit("eventName");

notes:

CBs are called in order of assignment
event will trigger all CBs to be executed, even if several events are triggered one after another
Node provides identical to browser API methods for completeness sake

Buffers

Binary data - data that consists of zeros and ones(aka base2 numbers), it is only data format that understood by computers

1 bit is single 0 or 1
1 byte is 8 bits
convertion:
- 01011 (base2) -> 1 * 2^0 + 1 * 2^1 + 0 * 2^2 + 1 * 2^3 + 0 * 2^4 -> 11 (base10)
first digit of sequence is Least Significant Bit/Digit (LSB/LSD)
last digit of sequence is Most Significant Bit/Digit (MSB/MSD)

Hexadecimal numbers - base16 numbers, that also widely used in computing, due to ease of conversion between base2 and base16, which enables easier notation etc

4 bits can be represented as single hex character, thus convertion can be easily done via table
hex is notated as 0x… OR #…
convertion:
- 0x456 -> 6 * 16^0 + 5 * 16^1 + 4 * 16^2 -> 1110
number are represented as 0-F(case insensitive)
- 0-F == 0-15

To work with characters computer requires usage of character encoding, which is basically a table that maps characters to some base2 numbers in specific way

ex:
- ACSII // defines 128 characters of English language and related special characters, 8 bits per char, subset of unicode
- Unicode(standard, that defines encodings: UTF-8, UTF-16, etc) // defines all possible characters
  - UTF-8 stores all characters in 8n bit sequences(8, 16, 24, 32)
note that “9” isn’t actual 9, it will be encoded as some other number
characters are transformed in process of encoding(transforming readable data to some format) and backward one called decoding
encoding must always be specified

Buffer - container(allocated location) in memory, basically a data-structure to work with memory, that used to fastly write, hold, operate and read data in it

buffer is always prefilled with zeros
Node buffers can be allocated and operated in byte size only
- basically you can imagine it as an array with 1 byte per element
buffer can’t be re-allocated, it is fixed size
- overflown data will be auto-discarded by Node
- avoid allocating redundant memory
buffer is limited by number of available RAM
buffer can hold only positive values(0-255) per byte, BUT we can represent negative values via standards(ex: (2's compliment + 1) * -1)
methods:
- Buffer.from(arr) - allocate buffer of proper size and populate it with provided data
  - alternative is to provide: (string, method) args to it
- buffer.fill(value) - fill value with provided value
- ---
- most of methods can be re-implemented, BUT it will be less efficient due to internal optimizations
notes:
- default max buffer size is 4GB, BUT it can be changed

Notes:

URLs will encode non-ASCII chars as hexadecimals in UTF-8
Node pre-allocates memory region to place there your buffers
Buffer is subclass of Uint8Array
Buffers support operations like .indexOf and similar iterative things, like in arrays

const { Buffer } = require('buffer');

const size = 16;
const filler = 420;

/*
allocation will always fill memory with `filler` OR `0`, so it can be slower
*/
const buf1 = Buffer.alloc(size, filler);

/*
allocate free region of memory, BUT don't clean it
risks: sensitive data can become part of a buffer, memory content might be compromised
if you need additional speed with safety, always fill this buffer ASAP
*/
const buf2 = Buffer.allocUnsafe(size);

const buf3 = Buffer.from([ /* ... */ ]);
const buf4 = Buffer.concat(buf1, buf2);

/*
the fastest way is to perform allocUnsafe on memory size that is <= Buffer.poolSize >>> 1 (>>> == bit shift to the right, with dropping LSD)
>>> is equal to division by 2 + Math.flour
*/

/*
Similar to unsafe, but doesn't utilize memory right away, so called slow
*/
Buffer.allocUnsafeSlow(size);

File System

When working with files in node you are actually working with Buffers, that represent content of file in binary format

File is basically a sequence of bits, that can be decoded in some specific manner(text, image etc)

file has content AND metadata about this file
Node is talking with OS via libuv that wraps SysCalls, that perform operations with files
- any file operation will use thread pool

Node has 3 implementations for same fs API: sync, async via callbacks, async via Promises

they has no difference underneath the hood, but simply just a syntactic sugar
Promises are the easiest to use
callbacks are the fastest
- note: errors always passed as first values in cb
synchronous can be used only for blocking operations

notes:

fs.watch can fire events several times per one save due to OS, program and other issues, out of our control
to properly execute read/write operations on file you need to open it first(assigns constant id number, that will identify this file and allow to refer to it for doing operation) AND then execute needed operation, using returned handler
- files must be closed to avoid memory leaks
- this operation allows fine grained access to file(open for long time, manage metadata, read in chunks, manage reading/writing streams etc), BUT it can be omitted, when you just need to quickly read somethings
- reading will shift position, so you might need to override default params
Node can decode/encode only characters, not images, videos etc

Streams

Basically stream is flow of some data, that either written OR read

data is added/received in chunks
- in Node, size chunk is 16kb
with stream you may sacrifice speed, by doing additional operations, BUT get reduction in resource usage
- BUT, if your data writes are smaller then 16kb, you will reduce number of operations
types:
- writable - used to write to
  - has internal Buffer to fill one chunk size, which will be emptied with corresponding operation(ex: write chunk to file)
  - Node will auto hold superfluous data in memory, if there is not place left in Buffer, until Buffer is emptied
    - stream has drain event that signifies, that Buffer emptied and can be filled again, SO you can prevent memory additional memory consumption
  - omit closing stream before it finishes it’s execution to omit data loss
- readable - used to read from
  - exposes data event, that will be fired after Buffer is filled and ready to be drained
  - readable stream is created in paused state, when event handler is added we start to read AND, after there is nothing else to read stream is closed
    - closed state can be detected by "end" event
- duplex - writable + readable
  - has two separate Buffers to do reading and writing
- transform - duplex, that can change data on fly

API (for consumers)

writable stream
- to check size of writable stream you can use .writableHighWaterMark AND to check how much of internal buffer is filled at the moment .writableLength exists
  - alternatively you can check boolean returned by .write, that will tell wether you can write to buffer OR write will go to memory
    - generally avoid writing if .write returns false, if you got false WAIT for "drain" event
  - "drain" happens only if full buffer fully emptied
- to do last write and close stream you can use .end(data)
  - next writes will always throw
  - .end will emit "finish" event
- if underlying resource(ex: file) is closed(.close()), stream will be closed and emit "close" event
- stream can be prevented from flushing data by .cork() and later re-enabled by .uncork()
- .destroy() will remove stream and underlying buffer
- you can change default encoding of stream via .setDefaultEncoding()
readable stream
- stream can be in two modes stoped and resumed(flowing)
  - all streams are paused from the moment of creation, until "data" event is added OR some other stream is piped via .pipe() OR .resume() is called
  - stream will be stopped, if .pause() is called OR all piped streams are removed via .unpipe()
  - can be tracked via .readableFlowing
- events:
  - "data" / "readable" - receive data chunks
    - don’t mix "readable" and "data" event usage
  - "end" - reading is finished
  - "close" - stream is closed
  - "pause - stream is paused
  - "error - error happened
- .pipe() allows to pass data from readable to writable stream, with auto-handling of backpressure
  - accepts only writable stream OR duplex OR transform, both of which implement writable stream
  - pipe returns stream, that was passed into it, so you can chain duplex streams, BUT not writable streams
  - avoid using pipe, when already subscribed to "data" event of readable stream
  - .unpipe() can be used to pause/stop reading
  - there are several events: "pipe", "unpipe"
  - note, that pipe has bad error handling, so, for production, it is better to use pipeline(source, ...duplex/transform[], destination, (err, val) => {}, that will kill underlying streams on error and pass this error to cb OR return final value
    - by default .pipe won’t do clean-up, so you need to manually call .destroy() on each stream
- .finished - called when stream errored OR not writable/readable OR unexpectedly closed, so you can do clean-up
- notes:
  - default buffer size for fs readable buffers is 64kb
  - when doing somethings like read from file and write to other file watch for backpressure, aka problem when you have higher input speed then output, THAT will ultimately cause memory issues
    - in fs this will happen due to hard-drive reading speed been much faster then write speed
  - cat unix command is analog of readable stream, because it will partially read file by chunks
  - your aren’t guaranteed that data will be split in proper chunks, ex: "123 123 123" string can be read as "12" + " 1" ...
  - fs.copy will copy via streams
  - by default, files over 2GB can’t be read directly without streams

API (for custom streams)

generally custom streams are created by inheriting from base stream class(Writable, Readable, Duplex, Transform), with need to implement some predefined set of methods
- omit emitting events by hand OR overwriting existing methods, use predefined callbacks AND overwrite only internal, allowed methods
- omit throwing errors inside custom functions, pass errors to callbacks
- JS don’t have multiple inheritance(thanks God), so Duplex is basically a built in class to do multiple inheritance from Writable and Readable
Duplex - stream that both writable and readable, BUT writing and reading is done fully separately(separate Buffers, separate source of reading and writing etc)
- it is possible to make Duplex’s read and write parts interact, BUT they don’t have to
  - this will make duplex a transform string (subtype of duplex)
  - transform inherits from duplex, BUT read and write methods are set in described above manner
  - transform can be: put through(no data transformation occur) and proper transform(data is changed in some manner)

Notes:

streams can be used in object mode, where we moving not buffers, but JS objects
- highWatermark will specify number of objects
- streams can also be used on FE in browser, mainly to work with network streams
  - note that browsers are using streams under the hood(ex: video streaming), BUT this API allows to utilize it in your code
  - Node has implementation of this API for compatibility, BUT it won’t replace Node’s streams API

Networking (lowest level possible in Node)

Node is designed and focused on scalable networking apps

Main concepts:

on pre-networking era information was passed via floppy discs
- write data to disk, physically move it to other PC and read from disk
networking elements are:
- ethernet cable to move data
- switch to connect several PCs to it and form small network
  - switch is receiving and sending packets of information from/to PC
    - each packet contains addresses of source and destination
    - switch can lookup addresses via address table
- networking card - part of PC, that enables connection between ethernet and PC
  - each card has MAC Address associated with it, which is uniq per device and basically a 48bits, represented by 6 hexadecimals, separated by :
- router - basically a higher order switch, that can communicate in between networks of switches via IP addresses (uniq address assigned to some device)
  - router assigns IP addresses to elements inside it’s local network in similar uniq manner, BUT this IP addresses aren’t uniq on global scale
networking is broken into separate, but connected layers, that act as abstractions, that built one upon another
- Physical - level where info is moved in form of bits via cables
- Data Link - level where info is moved in form of frames, with backed in MAC addresses
  - done by switches
- Network - level where info is moved in form of packets, with backed in IP addresses
  - done by routers, which calc shortest distance to other router
- Transport - level where safety and lossless of data is guaranteed
  - data is moved in segments
  - additional responsibility is connection establishment and proper disconnection(notify both sides that connection closed)
  - port numbers are added on this level to connect to application layer
  - TCP and UDP live here (there are others, but Node doesn’t have native support for them)
    - TCP - data is received in the way to sent
      - 3 way handshake is key algorithm -> A send connection packet, B respond with connection acknowledgment packet, A send data packets
      - headers (it has no IP related headers, because they are added on previous layer) (non-fixed size)
        
        source port (16bits)
        
        destination port (16bits)
        
        sequence numbers (32bits) - keep data in order
        
        acknowledgment numbers (32bits)
        
        …other…
        
        data
      - ex: ssh, http
    - UDP - data is sent as fast as possible, with potential losses
      - headers (fixed size of 8 bytes)
        
        source port (16bits)
        
        destination port (16bits)
        
        segment length (16bits)
        
        checksum length (16bits) - to prevent data corruption
        
        data
      - ex: http 3
- Application - abstract layer, where data is just data, that can be operated upon
  - Node server
Ports - a way to expose application to outside world (out of your PC)
- it is device based, but standard recommends to:
  - 0-1023 -> system ports, that ran as sudo
  - 49152-65535 -> safe private ports
    - can be used for dynamic activity
  - several apps can be created in same port, BUT they must have different transport(TCP/UDP/etc)
- there are list(iANA standard) of well known ports, that should be used for some standard activities
  - ssh = 22
  - http = 80 - specifying this port will redirect all http traffic without a port to your app
  - https = 443

net - module to work with network (on lowest level possible in Node) and inter-process communication (IPC), both of which have similar API

it is base for something like http module
creating server
- every app must have port bound to it for proper routing
- after server is created it exposes TCP connection to specified port
  - in Node it is represented as duplex stream
connection to server - net allows to establish TCP connections with other servers via sockets, which is also a duplex stream
- connection will throw, if no server to connect was found, due to how TCP specification is made
- connection will auto close, if server stops working
- client also need to have port, so it will be opened on fly from “dynamic” range, specified by iana spec

readline - module to read line by line from readable stream

can be greatly combined with process.stdIn, which is readable stream, that accepts input from console
- basically .question will do all heavy lifting for you to achiev this

notes:

all devices have built it loop-back address, which is universally standardized to be (127.0.0.1, or localhost DNS)
- this address will reroute requests back to device(either by device itself OR by router)
Node refers to opened connections as sockets, which, in SE world, means two opened connections, that have duplex style of communication
server can have multiple connections from clients, BUT each connection is represented through separate socket object
TCP packets are sent in order, BUT UDP aren’t
don’t forget to add encryption to your server(via TLS directly, or by using HTTPS, FTPS etc)
you can use pm2 CLI tool to run your Node process in background and keep shel interactive
ssh is tool to interact with server from CLI
scp is tool to handle file transfer with server

IP - Internet Protocol (third layer in networking model) IP Address - uniq address associated with device, that can be accessed through network

v4 - 32bits, with 8 bits(0-255) per portion (4 portions), separated with .
- max number of uniq devices that can be in single network is 4 billion
- IP comes with subnet mask, which indicates what portion of address is used for network and what for host (ex: 11111111.11111111.11111111.00000000, BUT in notation it will look like some some.ip.add.ress/24, where 24 is number of 1, that can be in range of 0-32)
  - this network portion is used to distinguish what traffic is routed to which network AND then by network to what device to route it
  - great for distinguishing requests to local and non-local networks
  - network portions are stored as tables inside routers
  - this led to standardization of this masks in such way, that it is easy to route traffic across the globe
    - this results in IP address now containing your location in non-explicit manner
    - this list is somewhat dynamic
- there are some private addresses, that used only for private networks and can’t be accessed from outside of network
  - reason for their existence is to save on public IP addresses (4 billon limit problem) and avoid assigning IP addresses to devices, that won’t need them in first place and can be just connected via private network)
  - to utilize them routers are doing NAT (Network Address Translation), that converts private IP address to public one
    - note that multiple privates can correspond to single public AND conflict resolution is done via NAT
  - ex:
    - 10.x.x.x
    - 172.16.x.x
    - 192.168.x.x
    - 127.x.x.x - loopback
      - note that for IPv6 this reduced to just use 127.0.0.1
- some addresses are reserved for specific companies, like IANA
v6 - while v4 is still widely used, we can’t allocate new IPv4s, so we need to have standard with higher number of uniq addresses
- in general it is faster and more improved version of v4 with higher capacity
- addresses
  - 128 bits - 8 portions of 16 bits, separated by : and represented as 8 portions of 4 hex values
    - 2^128 devices
  - leading zeros can be discarded
    - ex:
      - 00AF -> AF
      - 0000 -> 0
      - 0000:0000 -> :: // can be done only once per address
  - case insensitive (due to hex rules)
- private addresses:
  - loopback: 0000:0000:0000:0000:0000:0000:0000:0001 OR ::1 OR some other shorthand
  - AND thats it, because we don’t need to safe them anymore ;)
- adoption is hard due to legacy AND worsen UX, when working with such “ugly” numbers

DNS (Domain Name System) - system which basically doing conversion from human readable strings to IP addresses

this all DNS resolution is handled by DNS servers, that contain tables for conversion
- note that partial tables can be cached on any edge(browser, device, router, ICP, Country etc) of network
- DNS servers are preconfigured on device, BUT this list can be reconfigured
  - reconfiguration can lead to DNS hijacking, where legit DNS is changed to route to non-legit IP
    - there is even dns module, built in in Node, that can do lookups and other related stuff
      - Node will auto-resolve DNSes for you, if domain name is passed instead of IP
- built on top of TCP and can be built even via Node
each PC has private DNS table, for things like localhost + caching, mentioned before
great not only for UX, but to stabilize network, because single DNS can correspond to dynamic IP, or several IPs(great for keeping multiple servers on different edges of network and serve IP of closest one)

UDP

there is no guarantee of data ordering OR data completeness, when using UDP, BUT you gain speed
- still, you can built something like HTTP 3, where you use UDP under the hood, BUT ensure data completeness via retries
- this means that you work in connectionless model, meaning that you will just fire packets into the void and thats it
  - in Node you need to bind port, so your app can listen for any incoming info
Node allows to use UDP via dgram, which uses similar socket interface as net, BUT you don’t differentiate server from client here
- Node has default limit for buffer size, that can be sent per single UDP request
there are UDP4 for IPv4 and UDP6 for IPv6

HTTP

HTTP (Hyper Text Transfer Protocol) - is a protocol(set of rules), that sits in Application layer of Networking and responsible for handling format of data been sent, it allows to sent different data(JSON, text, forms etc)

HTTP is based on client/server model
- each request:
  - establishes TCP/UDP connection
  - client sends message, that is called request (because only client can request somethings)
    - firstly headers are sent
    - then body is sent in chunks (last chunk states, that it is last chunk)
  - server responds with message, that called response and it contains:
    - headers
    - body, that is sent in chunks, like message is sent
  - kills (Connection: close) TCP connection OR connection is kept (Connection: keep-alive) for following requests
    - for web development, keeping connection is often main choice
    - you can also configure timeouts per connection AND limit max number of connections
    - network changes will cause re-establishing of connection
HTTP breaks your data into:
- headers - some metadata from request, it includes:
  - method - indicator of what action should be taken
  - url - what endpoint is called
  - code - indicator about how server responded
  - other - how data is structured, other metadata about request etc
- body - actual data been sent
  - optional, but usually included
  - req.body comes as stream, that need to be properly handled
    - note that data can have predefined length, that defined by content-length, OTHERSWISE transfer-encoding: chunked must be used
      - improper content length will auto-cut your data, BUT you don’t need to explicitly .end() your stream
  - generally you will send JSON or plain text as body, but you can also send files in binary format OR multiple key-value(string OR file) pairs in form-data format
codes(404 etc) are part of protocol
- HTTP2 omits statusMessages in favor of codes
it is first class citizen of Node, that accessible via http module
- request in response in plain http module is readable and writable streams
- clients in Node are called agents, they are corresponding to TCP connections
  - data is sent through agents via request objects, that act as duplex stream
there are 3 HTTP versions as for now (1.1, 2, 3), BUT Node natively supports 1.1 and 2
- common strategy is to build 1.1 server and then enable 2 and 3 by adding proxies, that could convert from-to 1.1, to your server
- there is also HTTPS, but it is just encrypted version of HTTP AND often also configured via proxies
HTTP is stateless protocol, meaning that each request doesn’t know about existence of another one AND no state is preserved
- BUT you still can introduce state via headers, like Cookies
- one purpose is scalability (ability to proxy and distribute is possible only with stateless protocol)
under the hood HTTP formats headline(method, url OR status code, status text), headers and body as string in specific format, that can be parsed to receive values
- this makes HTTP requests easily readable, when encoded from bytes, BUT parsing them can be a bit problematic
  - also security is nightmare, so use HTTPS ;)

media types (MIME types) - a standard way to specify type of sent data

in OS world file extension is analog of MIME type
Content-Type header is used to sent this info
it must be included for proper work of server/client
- it is possible to figure-out content-type, but generally avoid this
- it can be figured-out by:
  - magic numbers - some file encodings have a number at start to identify format
  - file extensions
structure: type/subtype;key=value
- type - main type part, ex: image
  - there are two main classes: discrete(single file, ex: image, text) and multipart(multiple files, ex: multipart, message)
- subtype - some sub format of type, ex: png
- key=value - option, addition info, ex: charset=utf-8

HTTP methods - data, that included in requests, that used to tell what action must be done by server

HTTP methods mainly differ in been idempotent or not
- in this case idempotent means that multiple calls of endpoint with this method can’t change server state multiple times(only first one), BUT responses can differ, if needed
types:
- GET - retrieve some resource
  - request don’t have body
  - response have body
  - idempotent
- POST - create some resource OR perform some action
  - request have body
  - response have body
  - not idempotent
- PUT - create(alternative to POST, when you need to have idempotent behavior) OR fully update resource
  - request have body
  - response have body
  - idempotent
- PATCH - partially update resource
  - request have body
  - response have body
  - idempotent
- DELETE - delete resource
  - request may have body
  - response may have body
  - idempotent
- HEAD - retrieve headers
  - request don’t have body
  - response don’t have body
  - idempotent
- OPTIONS - retrieve possible communication options(what headers can be used, CORS)
  - request don’t have body
  - response may have body
  - idempotent
- and other less important…
notes:
- use different types as much as possible, this will act as docs + make app easier to integrate with

HTTP status codes - numbers(codes), that used to communicate client what was result of executed operation (as response)

ranges:
- 100-599 - general
- 100-199 - info
- 200-299 - success
- 300-399 - redirect
- 400-499 - client errors
- 500-599 - server errors
notes:
- generally try to be as descriptive as possible, BUT don’t give too much info to client for security reasons

notes:

main difference between server and web server is that web server works with web, meaning it servers HTML, CSS, JS over HTTP and utilized Web APIs
as automatization matter you can auto-serve all files from your public folder
- don’t forget about proper mine types
cookies - way to introduce state into stateless HTTP communication between server and browser
- cookies can be set by server response header Set-Cookie: key=val, where each header can hold only single cookie
- client will always sent all it’s cookies to server in form of request Cookie: key=val; key2=val2 header
  - be careful with performance degradation
  - this can be mitigated by using Path property, that will scope cookies sending to some path only
- ideally cookies shouldn’t be changed by client
  - this can be configured via httpOnly property
- don’t forget to expire cookies via Expire property
- https can be enforced by Secure property
be careful when working with large bodies, omit loading them into memory at once, do piping instead
- generally pipe when content-length is bigger then highWaterMark

Unix

Unix - historically is machine agnostic OS(written in C), that is base for modern OSs

it is required for BE, due to been base for MacOS and Linux, last been main server environment
Linux is not written on top of Unix, BUT it uses same philosophy (small and well done programs, that do one thing, embedded as modules and communicate via some standardized channel(ex: pipes)) and Unix compatible
Unix not only defines CLI utils, but OS file structure, permissions etc
programing languages can be Unix compatible too

Unix shells - some application(process), often written in C, that can communicate with kernel via syscalls to do some OS related work

each shell have specific instruction set, that mapped onto syscalls
shell is base for any terminal(TTY)
examples:
- sh - first Unix based shell
  - nowadays sh is often just alias for bash
- bash - common go-to shell, if you work with Linux
  - it is basically a programing language
- zsh - bash compatible shell, with additional features
  - MacOS default shell

bash command execution flow:

look for relevant alias (alias is alternative way to name command)
look for existing functions (custom, then built-in)
look at Path
- $PATH is variable, that stores links to folders with executable applications, so any application stored there can be ran from any part of system by just running command
- PATH is passed from parent process to child as copy, not by reference

file permissions - each file has 3 permissions(readable writable executable) in different combinations, that assigned to 3 groups of users(owner, owner’s group, other)

can be changed by chmod
note that executable files will run in current shell, BUT, running bash ./script.sh, will spawn child process with shell
file can’t be executable without been readable

to pass functions, variables and aliases from one script to other bash allows using source(OR alias .) command, that will execute script and put everything to parent shell

child_process - module that used to manipulate processes in OS

spawn("process_name", ["args"]) - spawns child process with provided args
- stream based
- will only look at PATH, to find commands, so be careful with running built-in functions(but some of them can be listed in PATH) or aliases
exec("command") - directly executes shell commands
- basically it spawns child shell process and execute commands
- can’t use streams

shell

can be login and non-login, where first can run additional commands/scripts before running
- login shell is slower
can be interactive and non-interactive, where first can wait for user input, in-between executing commands
shell configs:
- shell can run some default config files + file that can contain custom configs
  - custom config is often include: aliases, functions, env vars
- non-login + non-interactive - won’t run config
- login - run default+custom config
- non-login, interactive - run custom config

processes - every running thing in unix is process, that has some other process, with root been kernel itself

each process has: PID(process ID), PPID(parent PID)
each process can start child process via syscall
- this will pass parent’s envs to child, establish communication channel between them
- note that killing parent not always will kill child, BUT such child will have some other process managing it afterwards, otherwise it must be killed
all processes use RAM

env var - some variable that is set on environment level and will be passed to child processes

env var acts similarly to plain variable
can be created by export VAR_NAME (should be uppercase by convention)
key for deployment and hiding some sensitive info

file system - tree-like abstraction to manage data in Unix-compatible systems

main dir is root (/)
- / is also acts as separator of entries
$HOME - env var that references home dir path(/Users/yourUser) and often aliased as ~
. == current dir
.. == prev directory (relatively)
- to make path absolute it must start from / and it will always redirect to specified place
- be careful with relative paths in node, they will be calculated from where you call node process
  - to mitigate that you can use __dirname OR import.meta.dirname variable, that will always resolve in file’s dir
for better path management Node has path package
- it can work OS agnostic
$CDW - env var that stores current working directory path

data streams - streams that used to communicate between processes

stdin (standard in) - stream that grabs input and directs it to process
- by default stdin is connected to TTY, that proxies keyboard
stdout (standard out) - stream that grabs output of process and directs it to other
- commonly it is directed from process to TTY, that proxies it to monitor
stderr (standard error) - same as stdout, but for data, that shouldn’t be saved
---
streams can be redirected from/to any process or even file
- this can be configured in
  - Node, ex: console object can be created with custom stroud and stderr streams
  - Bash, ex: specify 0<dest for stdin, 1>dest for stdout, 2>dest for stderr as param when launching process
    - >dest === 1>dest
    - <dest === 0<dest
- note that all unix utils are configured to read from stdin and output to stdout

piping (|) - take stdout of process 1 and attach it to stdin of process 2

ignores stderr redirection (>) - redirect stdin, stdout, stderr from/to some destination
you can redirect to void by specifying /dev/null as dest
note that > will overwrite file, but >> will append

alternative way (to pipes) for inner-process communication in Unix is using IPC (inter-process communication) via Unix Domain Sockets

it is done via same net module and has similar interface to TCP
- instead of path+port we need to set path to “file”
  - this “file” is communication channel between two app
  - note that this is not actual file, but rather some uniq id, like port
    - in unix everything is a file, BUT it is actually has socket type
      - like dir is a file, but with dir type
it can be easier to build more complex communication this way
you still can use TCP+localhost to communicate between processes, BUT it is less efficient this way
- IPC can’t be use with internal machine, only inside single environment
- IPC utilizes “shared” RAM, instead network card
  - note that Node doesn’t support pure shared memory by default

clustering - running same application multiple times in different physical cores

by default, Node will run on single core and can’t be multithreaded
clustering is done via fork sys call, that basically cloning itself as child process
- Node includes standard cluster lib to do so
  - it is built upon child_process.fork(), which is special case of child_process.spawn()
- parent can fork itself many times, do coordination(often it is it’s only task) and pass data via pipes between processes
  - to utilize resources fully, parent can fork child process onto same core
- run fork only from parent to avoid infinite loops (actually Node will just fail to execute fork to prevent infinite loops)
  - can be checked via .isPrimary
- you can call fork n times, where n is number of CPU cores you have(os.availableParallelism())
note that Node.js allows to start multiple servers on same port via clustering
- Node will bind single port to parent process AND this parent will distribute calls to child processes via round robin algorithm
  - it is recommended to do manual scheduling for heavy processes to avoid high loads
- this is primary usage of this module
to share data between parent and child you can use worker.send(message) and process.on("message", (message) => undefined) OR process.send(message) and cluster.on("message", (worker, message) => undefined)
- send will do serialization for you
other events for cluster: "fork", "listen"
for production you can use pm2 lib to do same stuff
- you can run your app in cluster mode this way, BUT don’t have any additional control
- also can be used to just run application in bg with additional monitoring
avoid using cluster-only functions unconditionally

npm - package manager, that can install and manage code from external source

ffmpeg - library to work with videos and images

video - sequence of bites, that contains large amount of images(frames), audio(can be multiple) and some metadata
- each media is encoded in specific way, ex: images === h.256, audio === AAC, metadata === UTF-8
- extensions are just identifying container types for all of this data and correspond to some supported codex types, that they can contain inside
- each media is called stream and can be changed independently from one-another

notes:

in bash, process exits with some code, that signifies state(0 == executed successfully)
- to read prev command exit code you can use $?
bash allows to run processes in bg by specifying & at the end of command
in Unix each process has a nice value bound to it to determine resource usage priority
- ranges:
  - [-20, -1] - system processes
  - [0, 19] - other processes, where 0 is default value
- the lower the number - the higher the priority of process
great Unix tools
- ffmpeg - multimedia processing tool
- imagemagick - image processing tool
  - alternative is npm/Jimp
- poppler - pdf processing tool
- opencv - computer vision tool
- whisper - speech to text

Compression

Compression - encoding data in such way that original content is partially/fully preserved, but byte size is reduced

great to reduce size of stored OR transferred information
- remember that you need to label compressed files with compression type
types:
- lossless - no data is lost (main focus of this chapter)
  - in Node is done via zlib module, that based on transform streams
  - types:
    - gzip - the most widely adopted algorithm in web, have great performance to compression percetage rate
      - optimized for text data
      - in Node: zlib.createGzip() <=> zlib.createGunzip()
    - brotli - often have better compression percentage rate, compared to gzip, BUT consumes more CPU power
      - can be used for different data types
      - focused on optimization of web resources, like CSS, JS etc
      - in Node: zlib.createBrotliCompressed() <=> zlib.createBrotliDecompressed()
    - deflate - base for gzip(gzip adds some additional metadata)
      - in Node: zlib.createDeflate() <=> zlib.createInflate()
      - used in .zip
  - algorithms - single compression type can combine several compression algorithms for higher compression percentage (ex: Deflate itself is algorithm, that combines other algorithms)
    - Huffman Coding - probability based, because main idea is to find the most occurring thing and replace it with least amount of bytes
      - great for text encoding, because we already have character distribution data for all languages
    - LZ - family of algorithms, that based on some principles
      - find repeating patterns, save first one, reference first one in other places
        
        allows to configure windowSize, that identifies how far from each other repeating patterns can occur
  - compressed files won’t reduce in size after second compression
  - popular formats:
    - PNG - lossless images
    - FLAC - lossless audio
- lossy - some non-significant amount of data is lost, so it can’t be redone
  - doesn’t make sense to perform on text, BUT great for multimedia, because human eye can’t detect some details, thus they can be neglected
  - doesn’t natively supported in Node
  - often compressed files won’t reduce in size after second compression
    - OR it is not worse it, due to low size reduction
  - popular formats:
    - JPEG - remove details from images, that human eye can’t see
      - mainly reduces coloring
    - MP3 - audio compression
    - AAC - audio compression, better quality then MP3
    - H.264 - compresses all multimedia inside AND looks for static parts in-between frames to compress them in same way as LZ algorithms do by referencing
notes:
- pre-compress static data AND do caching on compressed dynamic data, because compression is resource-heavy operation
  - it is often not really worse it to compress text-based responses
  - also it is not worse to compress small pieces of data
- remember about speed and compression ration trade-off
  - also time not always linear
  - each algorithm can be configured to different level of compression
- additionally you can specify file type, BUT often it can be autodetected

http

in http world compression is done via including Accept-Encoding: ..., ... header by client to indicate supported compression formats
- usually done automatically by browser
  - same for decompression
- server can choose any compression AND, if any was chosen, it must respond with Content-Encoding: ... header
  - note that Content-Length must be of compressed data, but Content-Type of original data AND compressing is done only on body, not headers
  - client can also compress request body and include Content-Encoding, but it is not automatic AND generally avoided, because client doesn’t send too much data in general
usually compression+decompression is done via proxy OR middleware

minification - “compression” technique for source code, where file size is reduced by introducing changes to code, that doesn’t change logic of program, BUT reduce file size

remove whitespaces, remove comments, rename variables to shorter format
doesn’t work with binary data, BUT with text itself
great in combination with compression
can be compared with resizing in image optimization world

notes:

don’t compress sensitive data, like jwt tokens etc, because it is possible to do breach attack, where third-party can decrypt encrypted content, if it was compressed
avoid compressing small data AND already compressed data(ex: multimedia)
be careful with concurrent compressions, because Node allocates separate threads to do compression operations
gzip and brotli are doing checksum checks to preserve data integrity

Multi-Threading

While Node is called single-threaded, it still can offload heavy operations to separate threads

for this reason Node has threadpool, that is allocated to Node for any it’s needs
also Node allows to work with real kernel-level threads, do memory sharing and deadlock yourself ;)
the reason for “single-threaded” is because Node, by default, utilizes event-loop to manage concurrent actions, while other languages can utilize separate threads
- also JS itself is single-threaded AND multi-threading in Node is fairly new concept, that can be done via worker-threads
  - alternative way of handling is by spawning child process, but it is more resource heavy operation to spawn process, because it still spawns thread under the hood, then to just spawn thread
    - also threads can share memory without kernel intervention

Thread (same as process in Unix terminology) - unit of execution for process (single process can have one or more threads)

thread can be in ready (waiting in queue to be executed by CPU), running(been executed by CPU) or sleep(waiting for something to happen) modes
- there can be much more states, like: zombie state etc, BUT these are main once
- dispatcher moves threads from ready to running
- thread either finished OR moved back to queue after short time period
  - creates illusion of parallel execution(this illusion is called concurrent execution), even if CPU has single core
    - for multi-core CPUs, processes can be executed in true parallel manner
      - note that you don’t need to have physical cores, because on physical core can do two operations at a time, such sub-cores are called logical cores
  - thread is presented to CPU as instruction set, so it is easy to stop at some instruction AND continue later
- thread is moved to sleep state after doing blocking sys calls
  - after call is done, thread is moved back to queue
  - ex: when Node server process is listening on some port it is in sleep state, until some request comes in
adding threads to process allows to get more resources from CPU
- this will allow to boost performance by N times, if operation can be done in chunks in parallel
- you can prioritize some threads over others

Node

in Node, all thread operations are done via worker_threads module
Node allows to run files as separate threads, with separate event loop, context etc
threads are created in async, gradual manner (still faster then creating a process)
- each thread occupies RAM
  - be careful with creating threads on request, this can DDOS your service
thread communication is done via MessageChannel, that establishes communication channel between two ports
notes:
- child thread can’t make main thread exit
- child logs will go directly to parent thread
  - based on MessageChannel
  - blocked main thread will lead to missing logs
- there are other slight differences from main thread
- threads often spawned in for loop
- worker can spawn other workers
- you can either spawn and terminate thread on each operation OR create a pool of threads, that will live in BG AND execute tasks on demand without additional overhead
  - Node.js has built in thread pool, that, by default, consists of 4 threads, for different tasks
    - only utilized by Node
    - utilized by Node only when syscal can’t be done in async manner (ex: fs, crypto, zlib etc)
      - works only with async versions of functions
    - be careful with creating too many Promises, Node will need to hold all of them in RAM, so better to constraint max number

CPU intensive vs I/O intensive vs Memory intensive operations

CPU intensive - operation that takes large amount of CPU time
I/O intensive - operation that highly unitizes networking, file system etc to perform operation
Memory intensive - operation that takes large amount of RAM
Node itself can handle optimization of I/O operations out of the box AND you won’t see much benefit of multi-threading such operations, WHILE CPU intensive operations can be greatly optimized
- only in Node world, in other languages you still might need to multi-thread your I/O operations
  - note that for small operations it might not be the case AND batching can help to utilize Node’s potential
Memory intensive might not give as much benefit, so better to be tested first

shared memory

by default, Node keeps memory of each thread independent and shares values between them as deep copies, BUT it is still possible to utilize it
- thread concept helps here, because all threads share memory of single process, so no need to involve OS to do process communication and memory sharing
Node only allows to share buffers, not actual objects
- note that you can use Buffer as abstraction on top of TypedArray, that is abstraction on top of actual binary data, that is stored inside .buffer as ArrayBuffer or SharedArrayBuffer and transferred this way
  - for sharing you need to use only SharedArrayBuffer
  - note that Buffer abstraction can utilize pre-allocated pool of memory to store it’s data AND not allocate actual memory on flight
  - Buffer is always based on Uint8Array, but you can also utilize other TypedArrays
shared memory will always cause race condition problems that can be fixed with locks
- also remember about deadlocks
race conditions - cases, when several parallel operations try to modify shared resource, which leads to undetermined (based on timing of execution and final result is determined by last operation) behavior
- places, that cause race conditions called critical sections
- main mistake is to split read and write into separate operations
- can happen in any concurrent environment, even in single-threaded cases
- to concur race conditions you can utilize concept of atomic operations (operation that must complete fully and can’t be interrupted)
  - this is hardware level operations, provided by CPU and can be accessed via syscalls (or wrappers, like Atomics in JS)
  - hardware guarantees such atomic property by both locking underlying values, when operation executes AND by never interrupting atomic operation
  - atomic operations always are simple once, to implement more complex things you need to utilize concept of mutual expression to enforce that some part of code OR resource can be used by only n(often 1) concurrent function at given point of type
    - mutual exclusion is built upon atomic operations
    - that simplest form is spinlock (block process, while other executes critical section)
      - in JS it can be implemented via Atomics.compareAndExchange
      - more complex is semaphore, which can have N simultaneous processes running AND makes other waiting processes not just blocked, but sleeping
        
        if you have only two processes, this will be called binary semaphore AND if locking process does unlocking you will get a mutex
  - be careful with locks, because locking+unlocking in bad order OR forgetting to unlock will cause deadlock and make your application stuck
    - watch for order when working with multiple mutexes OR use single mutex instead of combining two of them

event loop - Node.js achieves async behavior via event loop concept and additional threads, that live in C++ world, while main program, written in JS is one single thread

flow:
- execute sync code(includes subscribing to async events and placing tasks into microtask queue)
- empty-out nextTick & microtask queue
  - Promise results live in microtask queue
  - nextTicks are ran in same stage as microtasks, BUT always ran before them
  - each sub-queue will be executed in order of enqueuing
  - nextTick & microtask queue will always be emptied after each CB in CB queue is executed AND between each sub-cb queue
- setup event loop
  - creates set of callback queues to be executed in specific order
    - libuv pushes executed operations here
    - eventloop is running as long as we have subscriptions AND CBs in callback queues
    - each CB is executed one after another
  - set of callbacks:
    - timers (setTimeout, setInterval)
    - poll (I/O operations)
    - check (setImmediate)
      - setImmediate CB will always execute next after poll sub-queue is done, BUT it is not guaranteed to be executed next otherwise
  - not that CB queues aren’t guarantee ordering of enqueued operations
notes:
- async/await is syntactic sugar over callbacks, thus await is not really blocking operation to our thread, it only blocks caller function flow of execution

notes:

alway do monitoring and testing, because multi-threading is mentally challenging and high error risk area of programming
utilize threads to offload heavy OR background OR just blocking operations to separate thread and avoid slowing down or even blocking the main one
each thread will have Max((Cores / Blocking_Parallel_Threads) * 100%, 100%) CPU usage
- note that 100% comes not from all CPU, but from single core
- to calculate this value more properly, you can use (total_time - total_idle_time) / total_time * 100% (of activity per core) formula
  - to calculate total cpu utilization, just cal mean of all cores utilization
- if process has multiple threads, it will use more than 100%
  - for Mac and some Unix based machines, BUT it is also can be represented as % from total CPU usage
not all cores can have same performance
any operation, that can be ran in parallel, is great candidate to be ran in multi-threaded way
always do testing with extreme inputs to detect potential problems
you can profile if your main thread is blocked via performance.evenLoopUtilization()
multi-threading light weight operations can downgrade performance
when working with worker pool be aware that some requests can be executed pretty fast, so they can be executed on main thread and don’t be blocked by heavy tasks OR they can have separate queue OR they can be batched
- using separate server for heavy tasks is also valid choice
- this is also relevant to topic of blocking event loop
- this can be used to DDOS your app
it is often faster to do shared memory, rather then message passing (excluding cases, when there is too much overhead to do locking etc)
for CPU intensive operations it is almost always better to use C++, SO you can create Node addon, written in C++, and delegate all expensive compute to it, by imbedding it inside your Node app (with no limitations, unlike WebAssembly)
- C++ app will ran as thread under Node process

Cryptography

Basic concept of cryptography is to use some key to modify info in such way, that it can’t be understood without key, BUT could be reverted into original form with key

key points
- confidentiality - data can be accessible only by holder of key
- integrity - content can’t be secretly modified
- authentication - identity of content’s author must be confirmable
cryptography ensures security and privacy of info and communication
for cryptography Node uses crypto, tls and https modules
- all of them are built upon openSSL
encryption is unbreakable at the current moment

symmetric encryption

flow:
- get original data, put it through Cipher(algorithm) with some private key and get encrypted data out
- get encrypted data, put it through Cipher(algorithm) with used private key and get original data out
common algorithms: AES (main), ChaCha (secondary), DES (deprecated main)
variants
- One-Time Pad Encryption (OTP, often called perfect) - key is destroyed after usage
  - requirements: key length is data length, key is truly random, key used once and destroyed
  - based in XOR operation, where we set result to 0 for same bits and 1 for different once, which is ^ operator in Node
  - this is more of a theoretical algorithm, because key management is problematic, due to need to securely pass key from one place to other before each data exchange AND due to key size
    - note that key can be used partially, but you always need to know message length
  - flow: generate random key, exchange it between parties in secured manner, each message uses part of key via XOR to be encrypted, thus destroying it for one party to encrypt and other to decrypt
  - this is mathematically impossible to break due to basically missing a part of data
  - for small data, it might be faster to brute force then something like AES, BUT it is much harder to find original data, because potential results will look similar to original data
- AES
  - private key management is done via some other mechanisms, primally via asymmetric encryption
  - AES has different modes
    - ecb - concats each block one after other with no random present
      - highly insecure, because repeated data will be easily detectable
      - 1 to 1 in size
      - fast and can run in parallel
      - if we corrupt some block, only it will be corrupted in final data
    - cbc - uses result of previous block (or random data for first one) to XOR next before doing encryption, concats each block one after other
      - 1 to 1 size + additional 128bits for random data
        
        random data must be truly random and never reused
        
        it can be sent as plaintext
      - corruption of block will lead to its and next neighbor corruption
      - sequential only
    - ctr - generate 128bit number (counter), that we gonna increment by 1 (starting from 0) for each block, encrypt it and XOR with block
      - 1 to 1 size + additional 128bits for random data
        
        it can be sent as plaintext
      - attacker knows what data is been encrypted, BUT it still doesn’t have key, thus we are safe
      - if we corrupt some block, only it will be corrupted in final data
      - can run in parallel
    - gcm (main) - ctr with additional message (MAC), that used to identify if original cipher was modified
      - it also allows to add additional verification data on top of key
  - notes:
    - key sizes can differ: (128, 196, 256) bits
      - defines repetitions: 10, 12, 14
    - works on 128bits blocks of info
      - for each block: XOR with key, (do byte substitution, do byte shifting, do byte mixing, XOR with key) (repeated N times, without mixing at last step)
        
        key here is 128 bit round key, that derived from original key through some math
        
        it called round due to it is used only once per round
      - block can be padded with 0x0f to match size
    - theoretically breakable AND might be bruteforced in distant future quantencomputers

hashing

flow:
- get original data of any size (message) -> pass through hash function (one way algorithm) -> receive fixed size output (hash, often 256 bites)
properties:
- process of hashing can’t be reversed
- hashes of similar messages can’t be similar
  - in other words, it can’t be guessable
- number of hash collisions must be as low as possible (ideally 0)
  - SHA-256 collision is still not found
algorithms:
- MD5 (broken), SHA family (SHA-1 (broken), SHA-n (256 is main), SHA3n, BLAKE2s256, BLAKE2b512)
  - often last number means number of bits in hash
some use-cases:
- generate keys (with often strict length) from any data
- ensure integrity of data by hashing it and transmitting hash alongside to verify if data wasn’t modified
- password storage
  - don’t forget to salt your passwords to omit rainbow table attacks and reduce possibility of brute forcing passwords
    - if user have chosen simple password, you can make it harder to find with large salt
    - still, too small passwords are bruteforcable
- HashMaps
  - it uses non-secure but, fast hashing function

message authentication codes - to prevent possibility of man in the middle attacks MACs can be used to verify data integrity, that transmitted over some channel

flow: get cipher text, combine with encryption key and hash them together, the result should be send as second part of messages, after cipher text
algorithms: hmac (main)
Node has built in family of MAC algorithms inside crypto
- it is better to use built in algorithms, because just appending key with message is vulnerable with some algorithms, like SHA-n sub-family
  - it opens possibility to append to original data and have matching MACs
  - it is called length extension attack
  - algorithms like hmac are splitting key into two parts and pad message from both sides

key derivation functions - functions that help creating consistent random keys from input, that often comes as human-readable password

flow: combine password with salt and do hashing N times to derive final result
algorithms: PBKFD2 (main), Scrypt (good for simple passwords), HKDF, Argon2
notes:
- passwords can’t be stored, you need to store only key and corresponding salt (16+ bytes)
- for longer keys that hashing algorithm can output we need to do each chunk of data separately

asymmetric (public-key) cryptography - process of data encryption and decryption, where pair of keys is used, one to encrypt and other to decrypt

keys are related via math, but not identical
flow: generate key pairs, send public key to other party, received encrypted info from second party, decrypt via private key
most of time, asymmetric cryptography is used to exchange keys for doing symmetric cryptography, because last one is much faster
algorithms: RSA, DH (Deffie-Hellman), ECDH (Deffie-Hellman with Elliptic-curve)
- RSA solves both exchange of symmetric keys problem AND authentication, that public keys belong to same person, BUT ECDH can be faster, so we can combine them
- RSA algorithm:
  - generate 2 random primes, get modulus of both keys, calculate Phi of both keys, calculate public exponent (often 65537 used for speed and resource preservation), calculate private exponent from all previous data
    - we need to save modulus and public exponent in public space AND primes and private exponent in private space
  - to break RSA you need to get modulus and find out two primes, that was used to calc it, BUT it is impossible, giving large enough primes
  - RSA add IV vector analog by default as part of input padding process
  - RSA has input size limitation
note:
- both keys are basically ciphertexts with some information put inside of them
  - it can be viewed via tools like openssl
  - both keys contain large primer numbers, that co-linked between keys
- both keys can be used to reverse each others operation, they aren’t limited to just one action
- key must generate party, that accepts data

digital signatures - process of signing (putting some data) OR verifying (checking that existing signature is vaid) for some data

similar to signatures in real world, BUT with math ;)
flow: sign hash of data with private key, send your public key with signed hash(often appended to actual data), everyone can verify that data belongs to you via your public key
main problem is that you need to involve some third-party to verify that particular person owns particular public key
- basically similar to how government enforces ownership of signatures
the same concept is met in https, there you also have certificates, with certificate authorities, that guarantee validity of public keys in corresponding to domain names
- X.509 format is commonly used format
  - it is pem file with some data: issuer, subject(server identity), public key, signature algorithm, issuer signature
- note that commonly used certificates come preinstalled on device, to omit man in the middle attacks, when doing initial exchange with issuer
  - this called root certificates, meaning certificates that was issued and signed by same subject (issuer == subject)
  - to check this list you can use tls.rootCertificates
    - TLS is encrypted analog of TCP, which is used to make HTTP secure
- certificates can be chained (often from 2 to 4)
- authority verifies you by domain name via some DNS record
  - this means that almost all authorities doesn’t legally verify business or force them to comply with any rules, so HTTPS is only guarantees secure connection to server itself
to enable HTTPS in your server you need to have certificate, issued by root (or associated with it) authority
- you can even set yourself as root inside your system and self-sign certificates
  - otherwise client will alert user, that connection can’t be verified and seems insecure
- to get certificate you first need to create request, that will be fulfilled by CA

TLS (Transport Layer Security) - most common protocol for establishing secure connections

base for HTTPS (HTTP + TLS == HTTPS) and other “S” application layer protocols
- in this schema TLS work between TCP and HTTP, so HTTP and all it’s content is encrypted
- note that SSH is not built upon TLS, it is secure by itself
TLS in Node.js can be used via tls module and has similar functionality to net, with additional security, that enforced by certificates, you need to provide, while using it
- tls exposes additional events, that allow to differ secure and non-secure connections to server
- TLS handshake is initiated by client, who asks for supported cipher and for communication in supported TLS version, after receiving it will verify cipher and tell server, that it is finished, after which server will agree, that handshake is finished
  - the result is that both entities are now have same public key
  - available versions of TLS: 1.1 (in progress of deprecation), 1.2, 1.3
    - 1.3 doesn’t support RSA, because we have single private key for all transactions, which is less secure that Deffie-Hellman, where we have separate pairs per session, BUT it is faster while initial handshake
- Node.js will use it’s own list of trusted CAs
TLS can be ran in MTLS mode, which means mutual TLS, which is basically TLS, but both parties need to authenticate themselves
TLS can reuse previous sessions to reduce load

key exchange protocol -

flow: both client and server generates key pair and exchanges their public keys, after that shared secret is created by combining your own private key with party’s public key on both sides
- result secrets are identical
- client and server need to share prime and generator numbers, BUT they are public information and can be reused, to avoid re-computations
main algorithm is Deffie-Hellman (but it is used with Elliptic Curve to reduce key sizes, while preserving security)
- ECDH is not only more secure (if you get rid of keys after session ended), but also more performant then RSA(note that it is classified as key transport protocol)
in TLS handshake client need to additionally send DH public key with it’s first request and server will respond with it’s DH public key
- in this variation of TLS public key of certificate is used to verify signature from server (which includes client’s & server’s DH public keys with additional info) and not do actual key transport, thus client can verify that server’s DH public key is authentic
  - RSA signatures can be used here or swapped with algorithm from Elliptic Curve family to reduce key sizes AND speedup calculations of signatures

dictionary:

plaintext - original data
ciphertext - encrypted data
encryption - conversion plaintext to ciphertext
decryption - conversion ciphertext to plaintext
cipher - algorithm for encryption and decryption

notes:

encryption universally works on binary data
http is transferred as plain text, thus you need https to prevent reading of transferred packets
encryption is lossless
JS’s Math.random is not secure and can’t be used for cryptography
encryption must produce same data for same key and input
- BUT, result of encryption can’t be guessable, based on similar inputs
system is as secure as it’s weak part
good algorithm is not only secure, but must be efficient, easy to understand, have possibility to use hardware acceleration with it and be reasonable to use
never put secrets directly into code, BUT also don’t overuse env vars for this purpose, it is much better to use dedicated key management storage (KMS)
if you have connection to DB or just any other server, you must encrypt connection, to avoid information exposure to server provider
- alternative is to use private network

Security

storing user password

avoid:
- plain text - compromise of DB will lead to password compromise
- encrypting passwords - compromise of key will lead to password compromise
- simple hashing - same passwords are prone to rainbow tables, weak passwords will result in weak hash
proper way is to hash N (1kk is often good and optimized enough) times with additional salt (16bytes at min), that properly infused into password
- ideally store metadata like iterations, key length, digest etc as part of password string OR in some relation to it
  - otherwise you need to ensure that they will never change
- salt always must be stored as part OR near the password
password comparison must always take same amount of time to avoid timing attacks
- Node.js expose API to compare buffers in same amount of time constantly
alternatively just don’t store passwords and use oAuth
notes: Node.js has built in way to properly hash passwords, BUT it is easier to use some libs like argon2, that has better API and better algorithm underneath

AsyncLocalStorage

Current way of managing state and context is either directly passing it to function OR passing it via some property of a shared (example: request) object

it is valid way, but can be restricting, complicated, introduce props drilling and prone to collisions
Node introduced built-in API to provide context in scope of async operation life-cycle
- AsyncLocalStorage comes from async_hooks module and first need to be created, by providing initial value and value setter; creating is often done inside some middleware, thus enabling possibility for any related handler to get access to store
  - note that other requests must be called within setter fn
- avoid sharing context between handlers OR modifying it from unexpected places; keep context lightweight

Modern Node.js Can Do That

Node can be ran in --watch mode, without need for nodemon
Node can strip types, so simplified (at least no enums) TS support is enabled
- it is possible to turn-on more complete subset of TS features too
Node has built in sqlite support
Node has built in testing framework
Node can inject envs via --env-file=.env flag
Node allow to create temp files, similar to sessionStorage and localStorage to store semi-persistent info
Node has built-in glob support as part of fs
Node has browser like WebSocket API (read-only)

import { setTimeout } from 'node:timers/promises';

await setTimeout(100, 'string');

Reference Architecture by Red Hat

Guide on how to create proper modern Node.js application, leveraging the power of npm

having shared architecture makes development of large-scale apps and systems easier and faster
this architecture is just one possibility, not one fits all solutions

Logging

Proper logging makes it possible to understand and debug systems AND keep traceability hight

pino - lightweight logger
structures logs as carriage-return separated JSON blocks, which is easy to parse, modify and search through
logs must be complete and contain all needed debug info
logs format must be consistent across the system
it might be useful to expose endpoint to configure logger dynamically
it must be possible to change or filter-out logs within the app
trace ids are great way to link all different services and parts of code into one transaction
- it also should be exposed to customer, so support could re-use it for searching
- AsyncLocalStorage can be useful here
enrich logs with user data (not email, but id for GDPR) to enable actions logging
log proper IP and not something like loadbalancer’s IP
avoid logging sensitive data
print-out app configuration on startup (tools like convict could help to omit logging sensitive configs)

Code Consistency

Code must be consistent for better DX, faster onboarding AND to enable refactors

consistency must be agreed on
consistency must be the same company-wise
automate as much as possible
eslint + prettier are main choices
- use pre-existing configs first
- plugins and add-ons are available
eslint must be automated:
- IDE integration
- on save
- pre-commit (husky)
- CI/CD
don’t forget to ignore node_modules

GraphQL

GraphQL is powerful tool, that harder to setup than REST, but it might be beneficial in longer run

agree on GraphQL Schema first (generating from resolvers is less stable, view schema as API contract)
cache, security and similar factors must be built differently, due to nature of GraphQL
keep schema generators and analyzers separate from consumer apps

Containers

Modern apps are containerized, backed into images and then this images get deployed to server

good container is: small, secure (don’t run as root AND don’t use privileged ports), properly utilizes resources, debuggable (keep some tooling as part of image)
start with choosing well-known and well-supported base image, that is light weight, has OS and Node.js runtime
keep size small for efficiency by caching and doing multi-stage builds
keep builds fast by:
- ignoring unneeded files
- separating dependency image from main image to keep it stable
limit Node’s memory usage to max available memory for container
container has poor handling of child_processes and zombies, so be careful with it
- don’t use npm start, this creates unnecessary npm process

Frameworks

Generally use Express, BUT you might also benefit from other framework, that is less of a standard, so do your own research (check: weekly downloads, github starts, issues, number of commits, creation date, frequency of commits)

look for stable, but maintainable solution, that wide-spread enough and had good enough docs

Code Coverage

Coverage is metric, that helps verifying how much code is tested (it doesn’t always correlate with quality of tests, but it is important non the less)

types:
- functional - number of FNs called
- statement - number of statements included in tests
- path coverage - number of conditional flows covered
- branch of decision - number of decision structures (ex: loop) covered
jest and vitest provide coverage reports
cover public API (contracts, expected behavior etc) first, than cover other behaviors of your application, from most to least critical
notes:
- look not for hight coverage, but for proper tests, that cover all expected behaviors and break, when needed

TS

TS addresses problem of weak types in JS with addition of some extra features

it helps be more confident with changes AND have self-documented code
TS must be transpiled into JS via tools like tsc
TS need to be configured via tsconfig.json
- look for stricter configs on new projects
TS allows for great autocompletes and refactoring via IDE
ship JS code, not TS that need to be transpiled
- when shipping library add type declarations alongside JS code

Security

Code and application must be secure by design:

dependencies
- choose only needed once, it may be better not to use it
- choose proper once
- choose well maintained once
- choose dependencies with well known dependencies of their own
- don’t autoupdate
- always fix audit errors and warnings
managing access
- set 2fa as required authorization for your repos
- use .ignore files
  - manage secrets separately (dotenv + passing envs via Docker is common combination, BUT you may also need some external vaults too)
- avoid mixing public and private npm packages inside single image, ideally build them separately
write defensive code
- avoid global state
  - never expose private info in it
- use NODE_ENV as environment var, because it is standard way to communicate with underlying libraries about how much info should be exposed
- validate user input (ideally both server and client, BUT server first)
  - remember about SQL injections and XSS
- handle errors properly
  - don’t expose too much details to client
- omit complex regexes (some regexes may be ok, BUT fail with some inputs, so be careful)
  - avoid ReDoS attacks
- expose only needed APIs via some prefix, like /api
- integrate ACL and auth into most flows
- share dependencies between projects in form of image
  - easier to update compromised dependencies

Accessibility

Accessibility is often a FE task, BUT CLI OR script accessibility and proper API is also important

for CLI focus on providing good documentation, understandable outputs (ideally internationalized), proper colors and text sizes and multi-format output of some data (stdout, HTML, CSV etc)
for API satisfy FE requirements, this often means different formats of output (i18n, l10n, different formats) and proper error messages

Dev workflows

Different devs may use different tooling or workflow (due to preferences OR other factors)

share knowledge
variations
- zero install - fully cloud workspace with representation layer in form of laptop
  - low control on dev side and high dependency on maintainers of env
- local development - all code & dependencies are cloned locally, last one installed from config
- local development with containers - all code & dependencies are cloned locally, dependencies are containerized and orchestrated
- local development of some components - part of code is cloned locally in form of container, that can be ran with mocks (or dev env creds), other code and dependencies are containerized, orchestrated and can be accessible via shared environment
- fully remote development wit containers - part of code is cloned locally, other code and dependencies are containerized, orchestrated and can be accessible via shared environment
each variations has pros and cons and suitable for different sizes and types of apps
- when moving from local to remote you lower complexity of development and testing, BUT development response loop drops

npm development

always init you package via npm init and provide reasonable information (don’t forget about license and support)
include transpiled code and docs only
- you can have several versions, like CJS or ESM
- build on prePublish
- generate type declarations if possible
use different scripts, BUT omit postInstall due to supply chain attack risks
dependencies
- use package-lock.json for proper versioning
- pin dependencies
- notes: ^ will give higher minor and patch releases, ~ will give every higher version in minor range
use semantic versioning and conventional commits
- you can even auto-update version by commit msgs via release-please or standard-release
use npm mirror for security, access/other restrictions, removing dependency on public registry

Problem determination

Common problems: memory leaks, performance issues, unhandled failures, resource leak, network issues, system issues

use metrics, logs, traces, core dumps, heap snapshots and alerts

To determine the problem:

match symptoms from different sources
confirm problem
gain additional info
rinse and repeat

Testing

Testing is important thing, that must be done and must be done properly

use jest, vitest or mocha (differs from jest and vitest, because it is unopinionated and only allow running tests with minimal infra)
- from resent times, it is also possible to use Node.js built-in testing framework
run tests against different Node versions (at least current and latest LTS)

Transactions

Node has several ways to work with transactions:

via DB transactions system and SQL BEGIN/COMMIT syntax
- don’t forget to ROLLBACK transaction on failure
for distributed systems you need to utilize patterns:
- 2 phase commit - reliable, but blocking operation
- Saga pattern - async event-based pattern, that makes one service to wait for completion event from other, BUT it introduces complexity

Load balancing, threading, and scaling

Some apps need more resources, here is how to satisfy them

note that JS is single-threaded, not Node (due to native threads via worker_threads OR by simple nature of microservices systems)
use threads for long running tasks, even if you have dedicated service for them, because you don’t won’t to block health etc
utilize scaling by container system, for simplicity, rather then Node’s API
- keep your app stateless for load balancing and scaling

CI/CD

use containers
use CI pipelines for code verification (lints, tests, builds etc) per PR
keep local, dev and prod envs similar
- utilize dev envs as means for testing and verification
- utulize blue/green releases for prod deployments
keep your app configurable per env
add audit pipelines (per time cycle OR per PR)