Is there any way to programmatically generate Python bytecode?

Question

I want to hack around with the Python interpreter and try creating a small DSL . Is there any module where I can do something like this theoretical code (similar to LINQ expression trees)?

expression_tree = Function(
    Print(
        String('Hello world!')
    )
 )
compile_to_bytecode(expression_tree)

Or would it just be easier to just generate Python source code? Could this be made easier by using C or SWIG or Cython?

Given the tremendous expressive power of OO languages (Python in particular) a DSL is fairly silly. Just write the Python. If you provide yourself good class definitions, you have a "DSL-like" Python and don't need this. — S.Lott
– S.Lott, Commented Sep 27, 2009 at 23:27
DSL's are never silly. We are more productive in DSL's. See "Code Complete" a Bible of software construction. — MicroservicesOnDDD
– MicroservicesOnDDD, Commented Jun 5, 2020 at 23:43

Alex Martelli · Accepted Answer · 2009-09-27 00:35:35Z

13

Working via ast and compiling the tree into bytecode, as another answer suggests, is probably simplest; generating sources and compiling those, almost as good.

However, to explore lower-level approaches, check out the links from this page; I've found byteplay especially useful (currently doesn't work on 2.6 nor 3.*, only 2.4 or 2.5, but I think fixing it for 2.6 should be easy, as currently discussed in its tracker). I have not used Phil Eby's similarly-featured BytecodeAssembler, but given the author's reputation I'm sure it's worth checking it out!

answered Sep 27, 2009 at 0:35

Alex Martelli

887k175 gold badges1.3k silver badges1.4k bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Thomas Baruchel · Accepted Answer · 2016-11-27 13:05:18Z

5

In Python 2.X, you would typically approach this with the compiler module and its ast sub-module (but note this module is deprecated since version 2.6). In Python 3.X, you would use just ast.

Both offer a compile() function that will go from source/AST to "a code object that can be executed by the exec statement or eval()."

edited Nov 27, 2016 at 13:05

Thomas Baruchel

7,5353 gold badges29 silver badges49 bronze badges

answered Sep 26, 2009 at 20:07

Mark Rushakoff

260k47 gold badges412 silver badges401 bronze badges

Comments

Lennart Regebro · Accepted Answer · 2009-09-26 21:11:44Z

2

It's easier to generate Python code and run it. If you do that you can also debug it more easily since there is actual source for the debugger to display. See also Malte Borchs article in the July issue of Python magazine, where he talks about this amongst other things.

answered Sep 26, 2009 at 21:11

Lennart Regebro

173k45 gold badges230 silver badges254 bronze badges

Comments

ACoolie · Accepted Answer · 2009-09-26 20:25:00Z

1

Fernando Meyer recently wrote a blog post explaining how to use the # coding directive to specify your own extensions to Python. Example (the actual format definition is in pyspec.py and tokenizer.py):

# coding: pyspec

class Bow:
    def shot(self):
        print "got shot"

    def score(self):
        return 5

describe Bowling:
    it "should score 0 for gutter game":
        bowling = Bow()
        bowling.shot()
        assert that bowling.score.should_be(5)

answered Sep 26, 2009 at 20:25

ACoolie

1,4491 gold badge12 silver badges16 bronze badges

Comments

Wojciech Kaczmarek · Accepted Answer · 2016-07-21 16:23:55Z

1

Updating for Python3 - there's also very interesting assembler zachariahreed/byteasm.

Actually the only one working for me in Py3. It has very nice & clean API:

>>> import byteasm, dis
>>> b = byteasm.FunctionBuilder()
>>> b.add_positional_arg('x')
>>> b.emit_load_const('Hello!')
>>> b.emit_load_fast('x')
>>> b.emit_build_tuple(2)
>>> b.emit_return_value()
>>> f = b.make('f')
>>> f
<function f at 0xb7012a4c>
>>> dis.dis(f)
  1           0 LOAD_CONST               0 ('Hello!')
              3 LOAD_FAST                0 (x)
              6 BUILD_TUPLE              2
              9 RETURN_VALUE
>>> f(108)
('Hello!', 108)

answered Jul 21, 2016 at 16:23

Wojciech Kaczmarek

2,3824 gold badges25 silver badges44 bronze badges

Comments

Nope · Accepted Answer · 2009-09-26 20:02:45Z

0

Check out the disassembler module found here:

http://docs.python.org/library/dis.html

answered Sep 26, 2009 at 20:02

Nope

36.2k42 gold badges100 silver badges119 bronze badges

2 Comments

Jason Baker Over a year ago

The only thing is that I was wanting to assemble bytecode, not disassemble it. :-)

Nope Over a year ago

ahh good point, sorry bout that ;) however the nice thing about the disassembler module is it lets you look at the bytecode that is generated as well as giving the specifics to the bytecode instructions.

Collectives™ on Stack Overflow

Is there any way to programmatically generate Python bytecode?

6 Answers 6

Comments

Comments

Comments

Comments

Comments

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Comments

Comments

Comments

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related