Implement `math.factorial(n)` to calculate very large factorials with speed #2558

kmr-srbh · 2024-02-27T18:46:31Z

Overview

The current implementation of math.factorial() is naive.
Returns 0 for large factorials due to integer overflow.

Solution

The industry standard for calculating factorial of a large number is complicated. It deals with Prime Number generation and Swing Numbers. These methods require BigInt for storing the large calculated value.

One fast method for going about calculating the calculation is to use an array to store the digits of the number as strings, in reverse, and do digit by digit multiplication. This method is used for implementing the algorithm.

I support the implementation of the Swing Number algorithms if one can understand and implement it correctly. The funny thing is that a Python implementation is available online.

Improvement

Works for large factorials
Works for very large factorials
Accurate

Integration tests have not been added due to the prerequisite below.

Prerequisite

While the function is complete, it depends on String to int conversion in ASR #2554 for getting merged. Currently it just prints the output to stdout and returns 0.
As it currently stands, the implementation for handling values larger than 2^63 - 1 is handled by the BigInt module we have. It is broken and returns pointers to the number instead. A vector<char> is required for storing and working with very large values.

Note: The long assignment of digits is because LPython currently does not support list assignment like: my_list[0:4] = [1, 2, 3]. The assignments can be improved through the introduction of the above list assignment.

… speed and accuracy

kmr-srbh · 2024-02-28T02:02:38Z

I think the errors are okay for now because the factorial function returns only 0. This will be addressed.

faze-geek · 2024-02-28T07:14:00Z

src/runtime/math.py

+    for idx in range(f_size - 1, -1, -1):
+        result += str(f[idx])
+    print(result)
+    return i64(0)


I do understand the approach, here the final result is printed out using concatenation.
I have not looked into the pre-requisites but even after a conversion from string to int, how will you return an integer here? The highest supported annotation we have is i64, which has a maximum value of 2^63 - 1.
This is not large enough to store anything over 20!, which means this will still encounter an overflow and return a garbage value.

I have tried something similar in another project, let me know if there are any other approaches. Thanks :)

@faze-geek I understand your point. Internally in LPython, the method used for handling values larger than 2 ^ 63 - 1 is handled by the BigInt module we have. The part of the module which does the handling for large numbers is broken and returns a pointer to the value instead. I had worked on a related issue #1356 and proposed implementing the vector<char> method for handling large values. Only then will we be able to handle values larger than 2 ^ 63 - 1.

I had forgotten to mention this prerequisite. Thanks for reminding me. I am adding it above.

I have tried something similar in another project, let me know if there are any other approaches. Thanks :)

Oh yes there are! As I had mentioned above, the industry standard for scientific computing applications is to use the PrimeSwing algorithm. The Python implementation is available online, but I do not want to type something I do not understand. 😄

The Python math.factorial(n) uses this algorithm.

Thanks. Will surely explore this once the BigInt module gets working.

kmr-srbh · 2024-02-28T14:22:16Z

@certik On further thought, I realized that to handle values large as shown above, we need to introduce a new data type BigInt where we store the number as digits in base 2 ^ 32 in a vector. The vector<string> method is not that fast. What is your take on this? If not going through BigInt, what else can we do? Please guide me.

If yes, please guide me on how can one go about creating a new data type?

kmr-srbh · 2024-05-01T05:06:36Z

Closing this as not planned for now.

Implement math.factorial(n) to calculate very large factorials with…

e52a1b3

… speed and accuracy

faze-geek reviewed Feb 28, 2024

View reviewed changes

Merge branch 'lcompilers:main' into modules

743c322

kmr-srbh changed the title ~~Implement math.factorial(n) to calculate very large factorials with speed and accuracy.~~ Implement math.factorial(n) to calculate very large factorials with speed Mar 4, 2024

kmr-srbh and others added 3 commits March 9, 2024 20:55

Use int as return type

e7857d0

Merge branch 'main' into modules

6445496

Revert setting return type to int

1478355

kmr-srbh closed this May 1, 2024

kmr-srbh deleted the modules branch May 1, 2024 05:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `math.factorial(n)` to calculate very large factorials with speed #2558

Implement `math.factorial(n)` to calculate very large factorials with speed #2558

kmr-srbh commented Feb 27, 2024 •

edited

Loading

kmr-srbh commented Feb 28, 2024

faze-geek Feb 28, 2024

faze-geek Feb 28, 2024

kmr-srbh Feb 28, 2024 •

edited

Loading

kmr-srbh Feb 28, 2024 •

edited

Loading

faze-geek Feb 28, 2024

kmr-srbh commented Feb 28, 2024

kmr-srbh commented May 1, 2024

Implement math.factorial(n) to calculate very large factorials with speed #2558

Implement math.factorial(n) to calculate very large factorials with speed #2558

Conversation

kmr-srbh commented Feb 27, 2024 • edited Loading

Overview

Solution

Improvement

Prerequisite

kmr-srbh commented Feb 28, 2024

faze-geek Feb 28, 2024

Choose a reason for hiding this comment

faze-geek Feb 28, 2024

Choose a reason for hiding this comment

kmr-srbh Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

kmr-srbh Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

faze-geek Feb 28, 2024

Choose a reason for hiding this comment

kmr-srbh commented Feb 28, 2024

kmr-srbh commented May 1, 2024

Implement `math.factorial(n)` to calculate very large factorials with speed #2558

Implement `math.factorial(n)` to calculate very large factorials with speed #2558

kmr-srbh commented Feb 27, 2024 •

edited

Loading

kmr-srbh Feb 28, 2024 •

edited

Loading

kmr-srbh Feb 28, 2024 •

edited

Loading