Permit FFI files in subdirectories #3601

PgBiel · 2024-09-08T23:11:03Z

Closes #1562

~~Still WIP (finishing the details regarding the duplicate module check)~~

Changes:

FFI files in subdirectories are now copied. For JavaScript specifically, they can be used as @external(javascript, "./file.mjs", "function"). For Erlang, you'd use the module's name as usual.
An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.
Added support for directories to the in-memory filesystem. (Otherwise, the is_directory function would return true for files and read_dir would also list the directory itself, causing tests to enter infinite recursion and crash.)

Potential improvements:

I'd like to add explicit integration tests for using @external with relative paths in JS (I tested locally with .mjs with node.js, Deno and Bun and it worked, but could be worth it to test other potential edge cases). Let me know where would be the best spot in the codebase to do so.
Consider having an error when a native file would overlap with a .gleam file's generated code (.mjs or .erl).

lpil

Thank you!

You'll see there's integration tests in the test directory which can be updated with nested FFI files

compiler-core/src/build/native_file_copier.rs

compiler-core/src/io/memory.rs

Ideally, `src/abc.gleam` and `src/abc.erl` would cause an error. However, this can actually occur legitimately when downloading a Hex package with precompiled `.erl` source files, which would trigger false positives. Wonder what would be the best way to detect this situation, but doesn't seem trivial to solve.

- Test header file in separate subdir - Test parent folder ffi in Erlang

PgBiel · 2024-09-15T18:02:25Z

Problem: My handmade recursive directory walker is too naive and can enter infinite symlink loops. I'll probably add a new method to FileSystemReader where we can address this at the IO boundary, similarly to how it's done for gleam_source_files.

- Prevent problems with infinite symlink loops

PgBiel · 2024-09-16T02:24:50Z

Alright, I've made the implementation more robust (it also now supports copying nested Erlang modules), added detection of conflicting .gleam and .mjs files (but not of conflicting .gleam and .erl files yet due to the issue outlined here: #1562 (comment) - we could leave this to a future PR), added detection of conflicting .erl files in separate subpaths, added more unit tests and added integration tests. 😄

inoas · 2024-09-16T13:45:12Z

Hello,

thanks for taking this on, this is lovely <3 for FFI interop!

In case this is desired and-or not yet considered:

Can we enforce some namespacing on Erlang, Elixir and JavaScript modules that maps to the directory structure and else throw a compiler error?

Edit:

... if possible, this might also make this check obsolete?

An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.

PgBiel · 2024-09-16T16:12:28Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

inoas · 2024-09-16T18:54:41Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

maybe it could warn.
I don't know... I think these FFI files will all be new and setting up strictness is nice there. You can still build any erlang/elixir library as a deprendency.

PgBiel · 2024-09-16T19:38:00Z

I think this is fine to be honest... Forcing people to name their modules like folder@folder2@something in every folder doesn't sound good or useful realistically. And it can clash either way.

lpil

Very nice, thank you.

It seems now that there's a concept of directories in the in memory file system and also the OS file system, and each is responsible for implementing traversal. I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

PgBiel · 2024-09-20T20:38:50Z

I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

If I understood correctly, you are proposing making a dir walking function which uses only the trait's read_dir function (and other trait functions if needed), right? Though I'm not sure if we would want to reinvent the wheel here, given we already have a package which implements graph traversal logic to avoid loops etc. What if we compromise and simply add a walk_dir function to the trait and use that where applicable?

Although I guess a re-implementation will be needed if we add symlinks to the in-memory FS down the road, so we could alternatively consider searching for existing platform agnostic implementations and using that, or even just re-implement while attempting to avoid reinventing the wheel as much as possible, e.g. by using special data structure crates if needed. Still, I'm not fully aware of all edge cases to know whether we would be able to match existing OS-specific implementations in terms of not only performance but also correctness.

lpil · 2024-09-21T20:16:02Z

Yes, that's right. We should not have two implementations of the same thing, and removing a dependency is beneficial for maintenance. Having one less dependency to verify for the EU Cyber Resilience Act is nice too!

I'm not worried about correctness seeing as this is a trivial algorithm, will be fine.

lpil reviewed Sep 10, 2024

View reviewed changes

compiler-core/src/build/native_file_copier.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

lpil marked this pull request as draft September 10, 2024 15:22

PgBiel force-pushed the nested-js-ffi branch from 29b67b1 to 0880a75 Compare September 12, 2024 23:31

PgBiel changed the title ~~Permit JS FFI files in subdirectories~~ Permit FFI files in subdirectories Sep 13, 2024

PgBiel added 19 commits September 15, 2024 14:33

add directories to in-memory FS

4a1a2f8

fix test-package-compiler fs test

8ff6b99

copy native files in subdirs

9e911cf

in-memory FS correctness adjustments

080f853

fix parent check

7b19bbc

improve file type check

3649c2e

create output subdir if it doesn't exist

89a78f9

update changelog with js ffi change

8a786e2

adjustments to io/memory

e40a9e6

add erlang duplicate module check

a2b84af

detect clash between gleam and erl modules

e3a422e

detect clashing gleam and javascript modules

87ab2fd

separate .gleam codepath

1242e4b

add subdir_ffi integration tests

19df05c

add subdir_ffi test to ci

507d7f2

use EcoString, add subdir Erlang file tests

acd5a05

more robust subdir_ffi tests

161accd

- Test header file in separate subdir - Test parent folder ffi in Erlang

update changelog to include erl ffi files

cb3980e

PgBiel force-pushed the nested-js-ffi branch from a7903ba to cb3980e Compare September 15, 2024 17:36

PgBiel added 3 commits September 15, 2024 20:29

use walkdir for native file search

ff9dd50

- Prevent problems with infinite symlink loops

add module and native file clash error

b459ad0

add error for duplicate native erlang module

df8cbd8

PgBiel force-pushed the nested-js-ffi branch from 5da6877 to df8cbd8 Compare September 16, 2024 01:42

add mjs bug fix to changelog

243a6c5

final nested ffi improvements

7bc2b6d

PgBiel marked this pull request as ready for review September 16, 2024 02:22

lpil reviewed Sep 20, 2024

View reviewed changes

lpil marked this pull request as draft September 20, 2024 13:53

custom dirwalker

109dcf4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permit FFI files in subdirectories #3601

Permit FFI files in subdirectories #3601

PgBiel commented Sep 8, 2024 •

edited

Loading

lpil left a comment

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

PgBiel commented Sep 20, 2024 •

edited

Loading

lpil commented Sep 21, 2024

Permit FFI files in subdirectories #3601

Are you sure you want to change the base?

Permit FFI files in subdirectories #3601

Conversation

PgBiel commented Sep 8, 2024 • edited Loading

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 15, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 20, 2024 • edited Loading

lpil commented Sep 21, 2024

PgBiel commented Sep 8, 2024 •

edited

Loading

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 20, 2024 •

edited

Loading