NumGenius Ai is a GPU marketplace. Hosts sell GPU Resources on the marketplace. Hosts are responsible for:
Setup: installing Ubuntu, creating disk partitions, installing NVIDIA drivers, opening network ports on the router and installing the Vast hosting software.
Testing and troubleshooting all issues that can arise, such as driver conflicts, errors, bad GPUs, and bad network ports. Vast does not offer support for getting your machine working. There is a host discord with helpful members and the host-general channel is searchable for specific errors.
Managing the listings and GPU offers for rentals, including setting pricing and end dates for the offers
Planning for maintenance so that no client jobs are affected
Clients high expectations coming from AWS or GCP.
As a host, plan to offer 100% uptime for your machine during the contracted period.
Expect that the GPU is going to be used at close to max capacity for the rental period.
Ensure that your Internet, power source and heat dissipation systems are all functioning and that you have thought through how hosting will affect each one of those items.
The expiration date can be set in the hosting interface by clicking on the date field under expiration and selecting a date for when the listing contract will expire.
The CLI command to 'list machine' includes a field for end date, which is the same date.
Make sure to set an end date before listing your machine, or else the listing will not expire.
The "client end date" is the date of the longest client contract on a given machine.
When clicking on the set pricing button, there is a min GPU field. The min GPU field allows you to set the smallest grouping of GPU rentals available on your machine in powers of 2, or down to 1. For example, if you have an 8X 3090 and set min gpu to 2, clients can create instances with 2, 4, or 8 GPUs. If you set min gpus to 1, then clients can make instances with 1, 2, 4 or 8 GPUs.
The interruptible price allows for the host to set the minimum interruptible price for a client to rent. Interruptibles work in a bidding system: clients set a bid price for their instance; the current highest bid is the instance that runs, the others are paused. more info
Reserved Instance Discounts are a feature for clients which allows them to rent machines over a long period of time at a reduced price. The Reserved Discount Pricing Factor represents the maximum possible discount a user can achieve on your machines.
The reserved discount pricing factor is a decimal value that represents the maximum discount a client can achieve on your contract. Eg: 0.4 represents a maximum of 40% discount.
This discount is not static, but rather scales over time that the user rents the machine for. EG: A client will get a 20% discount rate for 1 month, and a 30% discount rate for 3 months on the default setting.
You can set this number yourself to 0 if you wish to opt out of this feature.
There are two supported ways to test your own machine. If you want to use the website GUI, you will need to setup a new account on a different email address, add a credit card and then find your machine and create instances on it like a client. This has the benefit of showing you the entire client experience. Testing the recommended Pytorch template is vital to ensure that SSH and Jupyter are working properly.
The preferred method of testing your own machine is to run the CLI. For Windows users, we suggest setting up WSL which will require you to install Ubuntu on your Windows machine and change your bios settings to allow virtualization. Then you can start an Ubuntu terminal and run the CLI.
To rent your own machine you will need to first search the offers with your machine ID to find the ID and then create an instance using that ID. The show machine command will show all your connected machines.
./vast show machines
Then for each machine id you will need to find the available instance IDs.
./vast search offers 'machine_id = <machine ID>'
You can see the number of available listings as well as information about the machine. This is the fastest way to also see all the offers listed for a given machine. The website GUI stacks similar offers and so it is not easy to see all the listings for a given machine. That is not a problem for the CLI.
Take the ID number from the first column and use that to create a free instance on your own machine. This example loads the latest pytorch image along with both jupyter and ssh direct launch modes.
You can then look at your instance tab to make sure that pytorch loaded correctly along with jupyter and ssh. Click on the <_ button to get the ssh command to connect to the instance. Test the direct ssh command. Click on the open button to test jupyter. If the button is stuck "connecting" then there is most likely a problem with the port configuration on the router in front of the machine. Once finished, destroy the instance.
The proper way to perform maintenance on your machine is to wait until all active contracts have expired or the machine is vacant.
Unlisting will prevent new contracts from starting on the machine. However if you have a current client rental, you could set the end date to the client end date to allow for other clients to create instances on that machine that expire at the same date. Once the end date is reached, you can then unlist the machine and then perform maintenance.
For unplanned or unscheduled maintenance, use the CLI and the schedule maint command. That will notify the client that you have to take the machine down and that they should save their work. You can specify a date and duration.