SlideShare a Scribd company logo
PHP Internal Architecture
Pluggable, Extendable, Useable
Architecture
PHP piece by piece
You should know the basics
All the puzzle pieces
PHP
Input/Output
•SAPI
•Streams
Engine
•Lexer
•Parser
•AST
•Compiler
•Executor
Extensions
•Zend Extensions
•Compiled In
•Loaded at startup
•Loaded at runtime
Running PHP
server makes
request
SAPI talks to
engine
engine runs
SAPI returns
output to
server
How other languages do this
Python (Cpython)
• mod_python (embedded python
interpreter, deprecated)
• mod_wsgi (embedded or daemon) –
basically a mod_python copy OR
speaking to python interpreter with
a special library installed via unix
sockets)
• command line interpreter
• Fastcgi/cgi (using a library in python)
Ruby (MRI)
• also known as “CRuby”
• Matz’s Ruby Interpreter
• use Rack (library) to:
• write/run a ruby webserver
• use another server in between with
hooks to nginx/apache (unicorn,
passenger)
• use FastCgi/Cgi
And still more..
NodeJS
• Your app is your server
• This is a pain
• Write your own clustering or other
neat features!!
• So you stick a process manager in
front
• And you reverse proxy from
apache/nginx
• Or you use passenger or some other
server….
Perl
• Yes it still exists – shhh you in the
back
• PSGI + plack
• mod_perl
• mod_psgi
What makes PHP different?
• Shared nothing architecture by design
• application lifecycle is per-request
• no shared state natively
• infinite horizontal scalability in the language itself
• HTTP is a first class citizen
• You don’t need a library or framework
• SAPI is a first class citizen
• Designed to have a server in front of it
• No library necessary
• You don’t need a deployment tool to keep it all going
The answer to your question is
SAPI
Server API – the least understood feature in PHP
What is a SAPI?
• Tells a Server how to talk to PHP via an API
• Server API
• Server Application Programming Interface
• “Server” is a bit broad as it means any type of Input/Output mechanism
• SAPIS do:
• input arguments
• output, flushing, file descriptors, interruptions, system user info
• input filtering and optionally headers, POST data, HTTP specific stuff
• Handling a stream for the request body
In the beginning
• CGI
• Common gateway interface
• Shim between web server and
program
• Simple
• Stateless
• Slow
• Local
• Good security with linux tools
• Slow
• Local
• Programs can have too much access
• Memory use not transparent (thrash
and die!)
Then there was PHP in a Webserver
• mod_php (apache2handler)
• Run the language directly in the
webserver, speaking to a
webserver’s module api
• Can access all of apache’s stuff
• Webserver handles all the request
stuff, no additional
sockets/processes
• It works well
• Requires prefork MPM or thread safe
PHP
• Eats all your memories and never lets
the system have it back
• Makes apache children take more
memory
CGI is slow: FastCGI to the rescue!
• Persistent processes but CGI mad style
• Biggest drawbacks?
• “it’s old”
• “I don’t like the protocol”
• “it’s not maintained”
• “other people say it’s not stable”
• Apache fcgi modules do kind of suck 
• Nginx “just works”
• IIS8+ “just works”
php-fpm – Make FastCGI better
• FastCGI Process Manager
• Adds more features than traditional FastCGI
• Better process management including graceful stop/start
• Uid/gid/chroot/environment/port/ini configuration per worker
• Better logging
• Emergency restart
• Accelerated upload support
• Dynamic/static child spawning
CLI?
• Yes, in PHP the CLI is a SAPI
• (Did you know there’s a special windows cli that doesn’t pop a console window?)
• PHP “overloads” the CLI to have a command line webserver for easier
development (even though it SHOULD be on its own)
• PHP did that because fighting with distros to always include the cli-server would have
meant pain, and if you just grab php.exe the dev webserver is always available
• The CLI treats console STDIN/STDOUT as its request/response
php-embed
• A thin wrapper allowing PHP to be easily embedded via C
• Used for extensions in node, python, ruby, and perl to interact with PHP
• Corresponding extensions do exist for those languages embedded in PHP
phpdbg
• Wait – there’s a debugger SAPI?
• Yes, yes there is
litespeed
• It is a SAPI
• The server just went open source…
• I’ve never tried it, but they take care of the SAPI 
Just connect to the app?
• Use a webserver to reverse proxy to webserver built into a framework?
• Smart to use a webserver that has already solved the hard stuff
• But the app/web framework on top needs to deal with
• HTTP keepalive?
• Gzip with caching?
• X-forwarded-for? Logging? Issues
• Load balancing and failover?
• HTTPS and caching?
• ulimit? Remember we’re opening up a bunch of sockets!
Well, PHP streams can do that 
Streams
Input and Output beyond the SAPI
What is a Stream?
• Access input and output generically
• Can write and read linearly
• May or may not be seekable
• Comes in chunks of data
How PHP Streams Work
Stream
Contexts
Stream
Wrapper
Stream
Filter
ALL IO
Definitions
• Socket
• Bidirectional network stream that speaks a protocol
• Transport
• Tells a network stream how to communicate
• Wrapper
• Tells a stream how to handle specific protocols and encodings
Built in Socket Transports
• tcp
• udp
• unix
• udg
• SSL extension
• ssl
• sslv2
• sslv3
• tls
You can write your own streams!
• You can do a stream wrapper in userland and register it
• But you need an extension to register them if they have a transport
• Extensions with streams include ssh, bzip2, openssl
• I’d really like the curl stream back (not with the compile flag, but curl://)
Welcome to the Engine
Lexers and Parsers and Opcodes OH MY!
Lexer
• checks PHP’s spelling
• turns into tokens
• see token_get_all for what PHP sees
Parser + AST
• checks PHP’s grammar
• E_PARSE means “bad phpish”
• creates AST
Compiler
• Turns AST into Opcodes
• Allows for fancier grammar
• Opcodes can then be cached (opcache) skipping lex/parse/compile cycle
Opcodes
• dump with http://derickrethans.nl/projects.html
• machine readable language which the runtime understands
Engine (Virtual Machine)
• reads opcode
• does something
• zend extension can hook it!
• ???
• PROFIT
Extensions
How a simple design pattern made PHP more useful
“When I say that PHP is a ball of nails, basically,
PHP is just this piece of shit that you just put all
the parts together and you throw it against the wall
and it fucking sticks”
- Terry Chay
So what is an extension?
• Written in C or C++
• Compiled statically into the PHP binary or as a shared object (so/dylib/dll)
• Provides
• Bindings to a C or C++ library
• even embed other languages
• Code in C instead of PHP (speed)
• template engine
• Alter engine functionality
• debugging
So why an extension?
• add functionality from other languages (mainly C)
• speed
• to infinity and beyond!
• intercept the engine
• add debugging
• add threading capability
• the impossible (see: operator)
About Extensions
• Types
• Zend Extension
• PHP Module
• Sources
• Core Built in
• Core Default
• Core
• PECL
• Github and Other 3rd Party
–
“We need to foster a greater sense of community for people writing PHP
extensions, […] Quite what this means hasn't been decided, although one
of the major responsibilities is to spark up some community spirit, and
that is the purpose of this email.”
- Wez Furlong, 2003
What is PECL?
• PHP Extension Code Library
• The place for people to find PHP extensions
• No GPL code – license should be PHP license compatible (LGPL is ok but
not encouraged)
• http://news.php.net/article.php?group=php.pecl.dev&article=5
PECL Advantages
• Code reviews
• See https://wiki.php.net/internals/review_comments
• Help from other devs with internal API changes (if in PHP source control)
• https://svn.php.net/viewvc?view=revision&revision=297236
• Advertising and individual release cycles
• http://pecl.php.net/news/
• pecl command line integration
• actually just integration with PEAR installer (which support binaries/compiling) and
unique pecl channel
• php.net documentation!
PECL Problems
• Has less oversight into code quality
• peclqa?
• not all source accessible
• no action taken for abandoned code
• still has “siberia” modules mixed with “need a maintainer”
• never enough help
• tests
• bug triaging
• maintainers
• code reviews
• docs!
• no composer integration
• Half the code in git, half in svn still, half… elsewhere …
“It’s really free as in pull
request”
- me
My extension didn’t make it faster!
• PHP is usually not the real bottleneck
• Do full stack profiling and benchmarking to see if PHP is the real bottleneck
• If PHP IS the real bottleneck you’re awesome – and you need to be writing stuff
in C or C++
• Most times your bottleneck is not PHP but I/O
What about other languages?
• Ruby gem
• Will compile and install
• Node’s npm
• Will compile and install
• Perl’s CPAN
• Written in special “xs” language
• Will compile and install
• Python
• Mixed bag? Distutils can install or grab a binary
FFI
Talk C without compiling
What is FFI?
• Foreign Function Interface
• Most things written in C use libffi
• https://github.com/libffi/libffi
Who has FFI?
• Java calls it JNI
• HHVM calls it HNI
• Python calls it “ctypes” (do not ask, stupidest name ever)
• C# calls it P/Invoke
• Ruby calls it FFI
• Perl has Inline::C (a bit of a mess)
• PHP calls it…
FFI
Oh wait…
• PHP’s FFI is rather broken
• PHP’s FFI has no maintainer
• It needs some TLC
• There’s MFFI but it’s not done
• https://github.com/mgdm/MFFI
• Are you interested and not afraid?
For the future?
• More SAPIs?
• Websockets
• PSR-7
• Other ideas?
• Fix server-tests.php so we can test SAPIs 
• Only CGI and CLI are currently tested well
• More extensions
• Guidelines for extensions
• Better documentation
• Builds + pickle + composer integration
About Me
 http://emsmith.net
 auroraeosrose@gmail.com
 twitter - @auroraeosrose
 IRC – freenode – auroraeosrose
 #phpmentoring
 https://joind.in/talk/67433

More Related Content

What's hot (20)

PPSX
Php and MySQL
Tiji Thomas
 
PDF
PHP, Under The Hood - DPC
Anthony Ferrara
 
PPTX
Pointers, virtual function and polymorphism
lalithambiga kamaraj
 
PPTX
Java constructors
QUONTRASOLUTIONS
 
PPTX
While , For , Do-While Loop
Abhishek Choksi
 
PPTX
Introduction python
Jumbo Techno e_Learning
 
PPT
Final keyword in java
Lovely Professional University
 
PPT
PHP complete reference with database concepts for beginners
Mohammed Mushtaq Ahmed
 
PPTX
Strings in c++
Neeru Mittal
 
PPTX
Polymorphism In c++
Vishesh Jha
 
PDF
Strings in python
Prabhakaran V M
 
PPTX
Strings in Java
Abhilash Nair
 
PPT
Advanced PHPUnit Testing
Mike Lively
 
PPTX
Php
Shyam Khant
 
PPTX
C++ decision making
Zohaib Ahmed
 
PPTX
Type conversion
PreethaPreetha5
 
PPTX
Looping statement
ilakkiya
 
PPT
Java tutorial PPT
Intelligo Technologies
 
PPTX
Python: Modules and Packages
Damian T. Gordon
 
PDF
An introduction to Rust: the modern programming language to develop safe and ...
Claudio Capobianco
 
Php and MySQL
Tiji Thomas
 
PHP, Under The Hood - DPC
Anthony Ferrara
 
Pointers, virtual function and polymorphism
lalithambiga kamaraj
 
Java constructors
QUONTRASOLUTIONS
 
While , For , Do-While Loop
Abhishek Choksi
 
Introduction python
Jumbo Techno e_Learning
 
Final keyword in java
Lovely Professional University
 
PHP complete reference with database concepts for beginners
Mohammed Mushtaq Ahmed
 
Strings in c++
Neeru Mittal
 
Polymorphism In c++
Vishesh Jha
 
Strings in python
Prabhakaran V M
 
Strings in Java
Abhilash Nair
 
Advanced PHPUnit Testing
Mike Lively
 
C++ decision making
Zohaib Ahmed
 
Type conversion
PreethaPreetha5
 
Looping statement
ilakkiya
 
Java tutorial PPT
Intelligo Technologies
 
Python: Modules and Packages
Damian T. Gordon
 
An introduction to Rust: the modern programming language to develop safe and ...
Claudio Capobianco
 

Viewers also liked (12)

PDF
PHP 7 performances from PHP 5
julien pauli
 
PDF
Being functional in PHP (PHPDay Italy 2016)
David de Boer
 
PDF
PHP WTF
markstory
 
PDF
PHP 7 new engine
julien pauli
 
PPTX
Internet of Things With PHP
Adam Englander
 
PPTX
PHP Optimization
djesch
 
PPTX
Laravel Beginners Tutorial 1
Vikas Chauhan
 
PPT
How PHP Works ?
Ravi Raj
 
PDF
[Community Open Camp] 給 PHP 開發者的 VS Code 指南
Shengyou Fan
 
KEY
Php 101: PDO
Jeremy Kendall
 
PDF
LaravelConf Taiwan 2017 開幕
Shengyou Fan
 
PDF
Route 路由控制
Shengyou Fan
 
PHP 7 performances from PHP 5
julien pauli
 
Being functional in PHP (PHPDay Italy 2016)
David de Boer
 
PHP WTF
markstory
 
PHP 7 new engine
julien pauli
 
Internet of Things With PHP
Adam Englander
 
PHP Optimization
djesch
 
Laravel Beginners Tutorial 1
Vikas Chauhan
 
How PHP Works ?
Ravi Raj
 
[Community Open Camp] 給 PHP 開發者的 VS Code 指南
Shengyou Fan
 
Php 101: PDO
Jeremy Kendall
 
LaravelConf Taiwan 2017 開幕
Shengyou Fan
 
Route 路由控制
Shengyou Fan
 
Ad

Similar to Php internal architecture (20)

PDF
How PHP works
Atlogys Technical Consulting
 
PPT
Tech talk: PHP
Jen Wei Lee
 
PPT
Lamp Zend Security
Ram Srivastava
 
PPTX
Php’s guts
Elizabeth Smith
 
PDF
PHP is the King, nodejs is the Prince and Lua is the fool
Alessandro Cinelli (cirpo)
 
PDF
Integrating PHP With System-i using Web Services
Ivo Jansch
 
PDF
t1837
tutorialsruby
 
PDF
t1837
tutorialsruby
 
PPTX
Introduction to PHP.pptx
MarianJRuben
 
PPTX
Introduction To Web Development & The New Digital Workplace
Jen Wei Lee
 
PPT
Php
Ajay Kumar
 
PDF
Introduction to PHP (SDPHP)
Eric Johnson
 
PPT
Introduction to PHP - SDPHP
Eric Johnson
 
PPTX
How to Supercharge your PHP Web API
Aurimas Niekis
 
PDF
PHP is the king, nodejs is the prince and Lua is the fool
Alessandro Cinelli (cirpo)
 
PDF
Php and webservices
mcanotes
 
PDF
Web_Development_Using_PHP.pdf
AdityaSingh281706
 
Tech talk: PHP
Jen Wei Lee
 
Lamp Zend Security
Ram Srivastava
 
Php’s guts
Elizabeth Smith
 
PHP is the King, nodejs is the Prince and Lua is the fool
Alessandro Cinelli (cirpo)
 
Integrating PHP With System-i using Web Services
Ivo Jansch
 
Introduction to PHP.pptx
MarianJRuben
 
Introduction To Web Development & The New Digital Workplace
Jen Wei Lee
 
Introduction to PHP (SDPHP)
Eric Johnson
 
Introduction to PHP - SDPHP
Eric Johnson
 
How to Supercharge your PHP Web API
Aurimas Niekis
 
PHP is the king, nodejs is the prince and Lua is the fool
Alessandro Cinelli (cirpo)
 
Php and webservices
mcanotes
 
Web_Development_Using_PHP.pdf
AdityaSingh281706
 
Ad

More from Elizabeth Smith (20)

PPTX
Welcome to the internet
Elizabeth Smith
 
PPTX
Database theory and modeling
Elizabeth Smith
 
PPTX
Taming the resource tiger
Elizabeth Smith
 
PPTX
Modern sql
Elizabeth Smith
 
PPTX
Php extensions
Elizabeth Smith
 
PPTX
Taming the resource tiger
Elizabeth Smith
 
PPTX
Taming the tiger - pnwphp
Elizabeth Smith
 
PPTX
Php extensions
Elizabeth Smith
 
PPTX
Php extensions
Elizabeth Smith
 
PPTX
Lexing and parsing
Elizabeth Smith
 
PPT
Hacking with hhvm
Elizabeth Smith
 
PPTX
Security is not a feature
Elizabeth Smith
 
PPTX
Using unicode with php
Elizabeth Smith
 
PPTX
Mentoring developers-php benelux-2014
Elizabeth Smith
 
PPTX
Using unicode with php
Elizabeth Smith
 
PPTX
Socket programming with php
Elizabeth Smith
 
PPTX
Mentoring developers
Elizabeth Smith
 
PPTX
Do the mentor thing
Elizabeth Smith
 
PPTX
Spl in the wild - zendcon2012
Elizabeth Smith
 
PPTX
Mentoring developers - Zendcon 2012
Elizabeth Smith
 
Welcome to the internet
Elizabeth Smith
 
Database theory and modeling
Elizabeth Smith
 
Taming the resource tiger
Elizabeth Smith
 
Modern sql
Elizabeth Smith
 
Php extensions
Elizabeth Smith
 
Taming the resource tiger
Elizabeth Smith
 
Taming the tiger - pnwphp
Elizabeth Smith
 
Php extensions
Elizabeth Smith
 
Php extensions
Elizabeth Smith
 
Lexing and parsing
Elizabeth Smith
 
Hacking with hhvm
Elizabeth Smith
 
Security is not a feature
Elizabeth Smith
 
Using unicode with php
Elizabeth Smith
 
Mentoring developers-php benelux-2014
Elizabeth Smith
 
Using unicode with php
Elizabeth Smith
 
Socket programming with php
Elizabeth Smith
 
Mentoring developers
Elizabeth Smith
 
Do the mentor thing
Elizabeth Smith
 
Spl in the wild - zendcon2012
Elizabeth Smith
 
Mentoring developers - Zendcon 2012
Elizabeth Smith
 

Recently uploaded (20)

PPTX
A Complete Guide to Salesforce SMS Integrations Build Scalable Messaging With...
360 SMS APP
 
PPTX
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
PDF
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
PPTX
Platform for Enterprise Solution - Java EE5
abhishekoza1981
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PDF
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
PPTX
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
PPTX
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
DOCX
Import Data Form Excel to Tally Services
Tally xperts
 
PPTX
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
PPTX
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PDF
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
PDF
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
PPTX
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
PDF
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
PPTX
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
PPTX
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
A Complete Guide to Salesforce SMS Integrations Build Scalable Messaging With...
360 SMS APP
 
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
Platform for Enterprise Solution - Java EE5
abhishekoza1981
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Import Data Form Excel to Tally Services
Tally xperts
 
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 

Php internal architecture

  • 3. You should know the basics
  • 4. All the puzzle pieces PHP Input/Output •SAPI •Streams Engine •Lexer •Parser •AST •Compiler •Executor Extensions •Zend Extensions •Compiled In •Loaded at startup •Loaded at runtime
  • 5. Running PHP server makes request SAPI talks to engine engine runs SAPI returns output to server
  • 6. How other languages do this Python (Cpython) • mod_python (embedded python interpreter, deprecated) • mod_wsgi (embedded or daemon) – basically a mod_python copy OR speaking to python interpreter with a special library installed via unix sockets) • command line interpreter • Fastcgi/cgi (using a library in python) Ruby (MRI) • also known as “CRuby” • Matz’s Ruby Interpreter • use Rack (library) to: • write/run a ruby webserver • use another server in between with hooks to nginx/apache (unicorn, passenger) • use FastCgi/Cgi
  • 7. And still more.. NodeJS • Your app is your server • This is a pain • Write your own clustering or other neat features!! • So you stick a process manager in front • And you reverse proxy from apache/nginx • Or you use passenger or some other server…. Perl • Yes it still exists – shhh you in the back • PSGI + plack • mod_perl • mod_psgi
  • 8. What makes PHP different? • Shared nothing architecture by design • application lifecycle is per-request • no shared state natively • infinite horizontal scalability in the language itself • HTTP is a first class citizen • You don’t need a library or framework • SAPI is a first class citizen • Designed to have a server in front of it • No library necessary • You don’t need a deployment tool to keep it all going
  • 9. The answer to your question is
  • 10. SAPI Server API – the least understood feature in PHP
  • 11. What is a SAPI? • Tells a Server how to talk to PHP via an API • Server API • Server Application Programming Interface • “Server” is a bit broad as it means any type of Input/Output mechanism • SAPIS do: • input arguments • output, flushing, file descriptors, interruptions, system user info • input filtering and optionally headers, POST data, HTTP specific stuff • Handling a stream for the request body
  • 12. In the beginning • CGI • Common gateway interface • Shim between web server and program • Simple • Stateless • Slow • Local • Good security with linux tools • Slow • Local • Programs can have too much access • Memory use not transparent (thrash and die!)
  • 13. Then there was PHP in a Webserver • mod_php (apache2handler) • Run the language directly in the webserver, speaking to a webserver’s module api • Can access all of apache’s stuff • Webserver handles all the request stuff, no additional sockets/processes • It works well • Requires prefork MPM or thread safe PHP • Eats all your memories and never lets the system have it back • Makes apache children take more memory
  • 14. CGI is slow: FastCGI to the rescue! • Persistent processes but CGI mad style • Biggest drawbacks? • “it’s old” • “I don’t like the protocol” • “it’s not maintained” • “other people say it’s not stable” • Apache fcgi modules do kind of suck  • Nginx “just works” • IIS8+ “just works”
  • 15. php-fpm – Make FastCGI better • FastCGI Process Manager • Adds more features than traditional FastCGI • Better process management including graceful stop/start • Uid/gid/chroot/environment/port/ini configuration per worker • Better logging • Emergency restart • Accelerated upload support • Dynamic/static child spawning
  • 16. CLI? • Yes, in PHP the CLI is a SAPI • (Did you know there’s a special windows cli that doesn’t pop a console window?) • PHP “overloads” the CLI to have a command line webserver for easier development (even though it SHOULD be on its own) • PHP did that because fighting with distros to always include the cli-server would have meant pain, and if you just grab php.exe the dev webserver is always available • The CLI treats console STDIN/STDOUT as its request/response
  • 17. php-embed • A thin wrapper allowing PHP to be easily embedded via C • Used for extensions in node, python, ruby, and perl to interact with PHP • Corresponding extensions do exist for those languages embedded in PHP
  • 18. phpdbg • Wait – there’s a debugger SAPI? • Yes, yes there is
  • 19. litespeed • It is a SAPI • The server just went open source… • I’ve never tried it, but they take care of the SAPI 
  • 20. Just connect to the app? • Use a webserver to reverse proxy to webserver built into a framework? • Smart to use a webserver that has already solved the hard stuff • But the app/web framework on top needs to deal with • HTTP keepalive? • Gzip with caching? • X-forwarded-for? Logging? Issues • Load balancing and failover? • HTTPS and caching? • ulimit? Remember we’re opening up a bunch of sockets!
  • 21. Well, PHP streams can do that 
  • 22. Streams Input and Output beyond the SAPI
  • 23. What is a Stream? • Access input and output generically • Can write and read linearly • May or may not be seekable • Comes in chunks of data
  • 24. How PHP Streams Work Stream Contexts Stream Wrapper Stream Filter ALL IO
  • 25. Definitions • Socket • Bidirectional network stream that speaks a protocol • Transport • Tells a network stream how to communicate • Wrapper • Tells a stream how to handle specific protocols and encodings
  • 26. Built in Socket Transports • tcp • udp • unix • udg • SSL extension • ssl • sslv2 • sslv3 • tls
  • 27. You can write your own streams! • You can do a stream wrapper in userland and register it • But you need an extension to register them if they have a transport • Extensions with streams include ssh, bzip2, openssl • I’d really like the curl stream back (not with the compile flag, but curl://)
  • 28. Welcome to the Engine Lexers and Parsers and Opcodes OH MY!
  • 29. Lexer • checks PHP’s spelling • turns into tokens • see token_get_all for what PHP sees
  • 30. Parser + AST • checks PHP’s grammar • E_PARSE means “bad phpish” • creates AST
  • 31. Compiler • Turns AST into Opcodes • Allows for fancier grammar • Opcodes can then be cached (opcache) skipping lex/parse/compile cycle
  • 32. Opcodes • dump with http://derickrethans.nl/projects.html • machine readable language which the runtime understands
  • 33. Engine (Virtual Machine) • reads opcode • does something • zend extension can hook it! • ??? • PROFIT
  • 34. Extensions How a simple design pattern made PHP more useful
  • 35. “When I say that PHP is a ball of nails, basically, PHP is just this piece of shit that you just put all the parts together and you throw it against the wall and it fucking sticks” - Terry Chay
  • 36. So what is an extension? • Written in C or C++ • Compiled statically into the PHP binary or as a shared object (so/dylib/dll) • Provides • Bindings to a C or C++ library • even embed other languages • Code in C instead of PHP (speed) • template engine • Alter engine functionality • debugging
  • 37. So why an extension? • add functionality from other languages (mainly C) • speed • to infinity and beyond! • intercept the engine • add debugging • add threading capability • the impossible (see: operator)
  • 38. About Extensions • Types • Zend Extension • PHP Module • Sources • Core Built in • Core Default • Core • PECL • Github and Other 3rd Party
  • 39. – “We need to foster a greater sense of community for people writing PHP extensions, […] Quite what this means hasn't been decided, although one of the major responsibilities is to spark up some community spirit, and that is the purpose of this email.” - Wez Furlong, 2003
  • 40. What is PECL? • PHP Extension Code Library • The place for people to find PHP extensions • No GPL code – license should be PHP license compatible (LGPL is ok but not encouraged) • http://news.php.net/article.php?group=php.pecl.dev&article=5
  • 41. PECL Advantages • Code reviews • See https://wiki.php.net/internals/review_comments • Help from other devs with internal API changes (if in PHP source control) • https://svn.php.net/viewvc?view=revision&revision=297236 • Advertising and individual release cycles • http://pecl.php.net/news/ • pecl command line integration • actually just integration with PEAR installer (which support binaries/compiling) and unique pecl channel • php.net documentation!
  • 42. PECL Problems • Has less oversight into code quality • peclqa? • not all source accessible • no action taken for abandoned code • still has “siberia” modules mixed with “need a maintainer” • never enough help • tests • bug triaging • maintainers • code reviews • docs! • no composer integration • Half the code in git, half in svn still, half… elsewhere …
  • 43. “It’s really free as in pull request” - me
  • 44. My extension didn’t make it faster! • PHP is usually not the real bottleneck • Do full stack profiling and benchmarking to see if PHP is the real bottleneck • If PHP IS the real bottleneck you’re awesome – and you need to be writing stuff in C or C++ • Most times your bottleneck is not PHP but I/O
  • 45. What about other languages? • Ruby gem • Will compile and install • Node’s npm • Will compile and install • Perl’s CPAN • Written in special “xs” language • Will compile and install • Python • Mixed bag? Distutils can install or grab a binary
  • 46. FFI Talk C without compiling
  • 47. What is FFI? • Foreign Function Interface • Most things written in C use libffi • https://github.com/libffi/libffi
  • 48. Who has FFI? • Java calls it JNI • HHVM calls it HNI • Python calls it “ctypes” (do not ask, stupidest name ever) • C# calls it P/Invoke • Ruby calls it FFI • Perl has Inline::C (a bit of a mess) • PHP calls it…
  • 49. FFI
  • 50. Oh wait… • PHP’s FFI is rather broken • PHP’s FFI has no maintainer • It needs some TLC • There’s MFFI but it’s not done • https://github.com/mgdm/MFFI • Are you interested and not afraid?
  • 51. For the future? • More SAPIs? • Websockets • PSR-7 • Other ideas? • Fix server-tests.php so we can test SAPIs  • Only CGI and CLI are currently tested well • More extensions • Guidelines for extensions • Better documentation • Builds + pickle + composer integration
  • 52. About Me  http://emsmith.net  [email protected]  twitter - @auroraeosrose  IRC – freenode – auroraeosrose  #phpmentoring  https://joind.in/talk/67433

Editor's Notes

  • #2: PHP is an ever evolving piece of software that is useful and easily hacked without touching the core The features in PHP that make it so pluggable are not unique to PHP, other languages use them as well Understanding how PHP is structured and works in comparison to other languages can help you understand what your code is doing You don’t have to know C to see what’s going on “beneath the hood”
  • #3: PHP has a pretty standard architecture for an interpreted language actually. We’ll take a look at how it compares to other similar languages By similar I mean – no compilation step So Ruby, Python, and NodeJS We’re not going to compare apples to oranges here – what use would that be? So don’t get me started on compiled languages (yes, that includes JAVA, C#, go and Rust, sorry)
  • #4: It’s very important to understand the “big picture” of how your system fits together. You don’t have to know exactly how the parser works, but knowing you have a parser is important. It’s not that important to know how modules get loaded, but you should understand they exist and are loadable A lot of people make excuses that “you don’t know need to know how an internal combustion engine works to drive a car” Well, that’s bologna. I would hope you know that when you turn the key, it uses electricity in your battery (which needs to be working) to ignite the gasoline (which you also need enough of) in your car – you need to know the rules of the road and how to use the break No, you don’t need to know what the right mix ratio is for fuel to air in your cylinder But by George you better know that spark plugs make your engine go and they need to be replaced!
  • #5: So what do all these pieces do? The sapi and the streams do input and output control, no matter what we’re using for input or for output The engine actually processes our PHP files – doing the work Extensions add functionality to PHP (if you ever want an adventure – don’t just do disable-all, try forcing all extensions off by fiddling with the .m4 files for date, pcre, standard, tokenizer, and a few others) You will have almost NO functionality in PHP itself Streams are a bit broken because although parts of them are integrated into PHP’s sapi layer, other parts are integrated into the engine and still other parts into the standard extension (ugh)
  • #6: PHP is designed to be basic – the sapi says “run this script” and the engine runs it and returns output via the sapi That’s it The sapi gets to decide how things are handled There are ways of accessing input and output aside from our main sapi process, but they’re started from the main process They don’t exist independantly
  • #7: PHP gets away with mod_php because of it’s concept of shared nothing Do you notice a trend yet? The rack specification and wsgi have a LOT in common They’re weirdly (and purposely) very similar to PSR-7
  • #8: Ok, at this point things are getting a wee bit ridiculous Do you see the pattern? Yes, that’s right. There is no real difference between any of these things That’ s because they’re all solving the exact same issue
  • #9: A lot of this has been reinterated for years, but these are the reasons that PHP IS different What’s interesting to me that these are often cited as DOWNSIDES to php, and the last one? Half the ruby and python developers I talk to do NOT understand what I’m talking about Problem is most of the PHP folks don’t either 
  • #10: Why is this important? This IS The thing The choices you make depend on what you are doing! Do you want easy web dev and deployment? Do you want good horizontal scalability? Then PHP is your answer Do you have another question: well that’s your problem
  • #11: PHP is designed in a way so that not only do we have a pluggable infrastructure for adding functionality, we have a pluggable infrastructure for speaking input and output This is usually the number one question people are asked when they have a bug … and half can’t answer SAPIS are (one of) the most underutilized features in the system, basically because few people know they exist, fewer can use them And their API is not as robust as it could be because it hasn’t HAD to be
  • #13: So cgi has a lot of benefits actually Except for the fact that it’s slow as dirt Or molasses in july
  • #14: Make sure to talk about what an mpm is in apache – a multi-processing model
  • #15: Fastcgi really does work, and work well, mostly as advertised
  • #16: Basically smacks a process manager on top of the fastcgi protocol, this is not necessarily needed for webservers like IIS which have features like worker pools, but for Nginx and apache raw fastcgi is … not fabulous. This adds a bunch of management features
  • #17: Yes, the PHP cli is a sapi 
  • #22: Yeah, so PHP actually has an http stream and code to help write a server in it that takes care of a LOT of hard stuff for you, just in case you really want to write your own appserver and then reverse proxy into it ;) IF YOU’RE CRAZY
  • #23: Streams are pretty much the most awesomesauce part of the package
  • #24: Streams are a huge underlying component of PHP Streams were introduced with PHP 4.3.0 – they are old, but underuse means they can have rough edges… so TEST TEST TEST But they are more powerful then almost anything else you can use Why is this better ? Lots and lots of data in small chunks lets you do large volumes without maxing out memory and cpu
  • #25: All input and output comes into PHP It gets pushed through a streams filter Then through the streams wrapper During this point the stream context is available for the filter and wrapper to use Streams themselves are the “objects” coming in Wrappers are the “classes” defining how to deal with the stream
  • #26: What is streamable behavorior? We’ll get to that in a bit Protocol: set of rules which is used by computers to communicate with each other across a network Resource: A resource is a special variable, holding a reference to an external resource Talk about resources in PHP and talk about general protocols, get a list from the audience of protocols they can name (yes http is a protocol) A socket is a special type of stream – pound this into their heads A socket is an endpoint of communication to which a name can be bound. A socket has a type and one associated process. Sockets were designed to implement the client-server model for interprocess communication where: In php , a wrapper ties the stream to the transport – so your http wrapper ties your PHP data to the http transport and tells it how to behave when reading and writing data
  • #27: Internet Domain sockets expect a port number in addition to a target address. In the case of fsockopen() this is specified in a second parameter and therefore does not impact the formatting of transport URL. With stream_socket_client() and related functions as with traditional URLs however, the port number is specified as a suffix of the transport URL delimited by a colon. unix:// provides access to a socket stream connection in the Unix domain. udg:// provides an alternate transport to a Unix domain socket using the user datagram protocol. Unix domain sockets, unlike Internet domain sockets, do not expect a port number. In the case of fsockopen() the portno parameter should be set to 0.
  • #30: Lexical Analysis Converts the source from a sequence of characters into a sequence of tokens
  • #31: Syntax Analysis Analyzes a sequence of tokens to determine their grammaticalstructure
  • #32: 5.6 and 7+
  • #33: Generate bytecode based on the information gathered byanalyzing the sourcecode
  • #34: so zend is actually a “virtual machine” it interprets OPCODES and does stuff with them reads each opcode and does a specific action – like a giant state machine
  • #35: Extensions are the soul of what makes PHP great If you take away the extensions there’s not a lot left in PHP Sadly there isn’t a nice API to go with extensions, and that’s something PHP should address
  • #36: PHP IS a glue language, it was not designed, it grew… and it was designed to be Cish and tie TIGHTLY to C without making the poor dev worry about the hard stuff! PHP has probably the sharpest delineation between a library (PHP code) and an extension (C code you have to compile) of any interpreted language out there. It is also arguably the most modular and extensible. This is not necessarily a bad thing, except that because so many people do PHP “on the cheap” they’re absolutely terrified of extensions! If C (or C++) can do it – PHP can be glued to it. The extension architecture introduced was probably one of the BEST decisions ever made for PHP! – And a bit of trivia – it didn’t exist before PHP 3.0
  • #37: I lied in the slide here – you can bind to non C/C++ stuff too, there are some clever bindings to .NET managed assemblies and cocoa bindings and all kinds of evil – but those are not common use case and are hard to do extensions have to be compiled for the major.minor version you’re using – and there’s lots of different flags and such that can make binaries incompatible, this is one of the strengths and weaknesses of a compiled language, it’s optimized for what it’s meant to run on but doesn’t work at all elsewhere
  • #38: Porting C libraries in pure PHP could be done – but why in the world would you do that!! Unless you’re doing something incredibly stupid – C code is going to be faster than PHP No matter what you do, there are parts of the engine you can’t touch – in userland Extensions do NOT have this limitation, you can do all kinds of evil. Yes there are extensions for threading, debugging, compiling php opcodes, intercepting function calls, profiling, you name it. C is a POWERFUL thing.
  • #39: Two types of extensions zend extensions can dig right into the engine itself php module cannot A PHP extension can be both! (see xdebug) So what are “core default” extensions – these are the ones that are distributed with the PHP source code and are turned on by default – there are a few that are “always on” (as of 5.3) – PCRE, date, standard (duh), reflection and spl There are bunch that are generally turned “on by default” – PHP tries to build them even if you don’t flag them as on (not having the libs necessary will turn them off) – these include things like libxml2, zlib, iconv There are still a bunch more included in the core source – some are excellent and some – some suck
  • #41: pickle or peh cul (depending on your side of the pond) is the ongoing argument for pronunciation – bottom line is the logo IS a pickle yo ;) Benefits: ▫▫code hosting and distribution (git! and github mirror!) ▫pecl install (PEAR packaging and installer support) ▫community ▫advertising Split off from PEAR in 2003 •WezFurlong is “instigator” ▫see famous letter ▫http://news.php.net/article.php?group=php.pecl.dev&article=5 •Many of the points raised still need work –mainly PECL needs manpower, bodies to do the work
  • #45: I get all the time “PHP is so slow” If you’re connecting to a remote memcache server, and a remote redis server, and a remote webservice, and a remote database… even if all that code is fast you’re screwed – scale right, scale truly horizontal with localhost connections for everything you can get away with At the end of the day
  • #47: FFI is a cool concept but the number one rule of FFI is… you can totally shoot yourself in the foot with it! The basic idea (and ruby and python use this extensively) is to “wrap” your C stuff with the FFI extension, and then Write python/ruby code to unstupid the APIS
  • #49: So many languages support this idea of calling into usually C code Then they usually put a layer of regular code “on top” to make APIS non painful This can be useful it also tends to be slower
  • #50: before you get into extension writing – if you just want an ffi wrapper and are just going to call the exact C calls from an existing library why go to the trouble of writing an extension? ffi is pretty great but a bit of a flakey extension yet, but it’s identical to python’s “ctypes” which is a stupid name, it’s really ffi I hear al lthe time about how “great” python is because of ctypes, frankly I beg to differ. Part of wrappign a C extension is translating the C calls into something far more “phpish”
  • #51: oh ffi is cool and so needs a maintainer
  • #52: Some of my wishlist of how I’d love to see you get involved 
  • #53: Would love to do some code for evil on this 